Mon. Not. R. Astron. Soc. 000, 000-000 (0000) Printed 1 February 2008 (MN I^TeX style file v2.2) 



The 2dF Galaxy Redshift Survey: Spectra and redshifts 



o 
o 



Matthew Colless^, Gavin Dalton^, Steve Maddox^, Will Sutherland'*, Peder 
Norberg^, Shaun Cole^, Joss Bland-Hawthorn^, Terry Bridges^, Russell Cannon^, 
Chris Collins^, Warrick Couch^, Nicholas Cross^, Kathryn Deeley^, Roberto De 
Propris^, Simon P. Driver'*, George Efstathiou^, Richard S. EUis*'^, Carlos S. Frenk^, 
^ ■ Karl Glazebrook**, Carole Jackson*, Ofer Lahav^, Ian Lewis^, Stuart Lumsden*^, 
; Darren Madgwick^, John A. Peacock*^, Bruce A. Peterson*, Ian Price*, Mark 
Q ; Seaborne^, Keith Taylor^'*^ (the 2dFGRS team) 

C D ' ^Research School of Astronomy & Astrophysics, The Australian National University, Weston Creek, ACT 2611, Australia 
T~H '^Department of Physics, University of Oxford, Keble Road, Oxford OXl 3RH, UK 

^School of Physics & Astronomy, University of Nottingham, Nottingham NG7 2RD, UK 
' School of Physics and Astronomy, University of St Andrews, North Haugh, St Andrews, Fife, KY6 OSS, UK 
^ ^Department of Physics, University of Durham, South Road, Durham DHl 3LE, UK 
OO , ^ Anglo- Australian Observatory, P.O. Box 296, Epping, NSW 2121, Australia 

. "^Astrophysics Research Institute, Liverpool John Moores University, Twelve Quays House, Birkenhead, L14 ILD, UK 
i ^Department of Astrophysics, University of New South Wales, Sydney, NSW 2052, Australia 
' '^Institute of Astronomy, University of Cambridge, Madingley Road, Cambridge CBS OHA, UK 
' ^'^ Department of Astronomy, California Institute of Technology, Pasadena, CA 91125, USA 

Department of Physics & Astronomy, Johns Hopkins University, Baltimore, MD 21218-2686, USA 
Department of Physics, University of Leeds, Woodhouse Lane, Leeds, LS2 9JT, UK 
^ Institute for Astronomy, University of Edinburgh, Royal Observatory, Blackford Hill, Edinburgh EH9 3HJ, UK 



o 



Accepted — . Received — ; in original form 



ABSTRACT 

. . The 2dF Galaxy Redshift Survey (2dFGRS) is designed to measure redshifts for ap- 

I proximately 250 000 galaxies. This paper describes the survey design, the spectroscopic 

^ . observations, the redshift measurements and the survey database. The 2dFGRS uses 

the 2dF multi-fibre spectrograph on the Anglo- Australian Telescope, which is capable 
of observing 400 objects simultaneously over a 2° diameter field. The source catalogue 
for the survey is a revised and extended version of the APM galaxy catalogue, and the 
targets are galaxies with extinction-corrected magnitudes brighter than bj=19.45. The 
main survey regions are two declination strips, one in the southern Galactic hemisphere 
spanning 80°xl5° around the SGP, and the other in the northern Galactic hemisphere 
spanning 75°xl0° along the celestial equator; in addition, there are 99 fields spread 
over the southern Galactic cap. The survey covers 2000 deg^ and has a median depth 
of z=0.11. Adaptive tiling is used to give a highly uniform sampling rate of 93% over 
the whole survey region. Redshifts are measured from spectra covering 3600-8000A at 
a two-pixel resolution of 9.0A and a median S/N of ISpixel^^. All redshift identifica- 
tions are visually checked and assigned a quality parameter Q in the range 1-5; Q>3 
redshifts are 98.4% reliable and have an rms uncertainty of 85kms~^. The overall 
redshift completeness for Q>3 redshifts is 91.8%, but this varies with magnitude from 
99% for the brightest galaxies to 90% for objects at the survey limit. The 2 dFGRS 



database is available on the WWW at http://www.mso.anu.edu.au/2dFGRS 
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1 INTRODUCTION 

The 2dF Galaxy Redshift Survey is designed to measure red- 
shifts for approximately 250 000 galaxies in order to achieve 
an order of magnitude improvement on previous redshift sur- 
veys, and provide a detailed and representative picture of the 
galaxy population and its large-scale structure in the nearby 
universe. The main goals of the 2dFGRS are: 

1. To measure the galaxy power spectrum P{k) on scales 
up to a few hundred Mpc, filling the gap between the small 
scales where P{k) is known from previous galaxy redshift 
surveys and the largest scales where -P(fe) is well-determined 
by observations of the cosmic microwave background (CMB) 
anisotropies. Particular goals are to determine the scale of 
the turnover in the power spectrum and to observe in the 
galaxy distribution the acoustic peaks detected in the CMB 
power spectrum (Percival et al. 2001). 

2. To measure the redshift-space distortion of the large- 
scale clustering that results from the peculiar velocity field 
produced by the mass distribution (Peacock et al. 2001). 
This distortion depends on both the mass density parame- 
ter SI and the bias factor b of the galaxy distribution with 
respect to the mass distribution, constraining the combina- 
tion p = Q°'^/b. 

3. To measure higher-order clustering statistics of the 
galaxy distribution in order to: (a) determine the bias pa- 
rameter b, revealing the relationship between the distribu- 
tions of mass and light and yielding a direct measure of 
Q; (b) establish whether the galaxy distribution on large 
scales is a Gaussian random field, as predicted by most in- 
fiationary models of the early universe; and (c) investigate 
the non-linear growth of clustering in the small-scale galaxy 
distribution. 

4. To fully and precisely characterise the galaxy pop- 
ulation in terms of the distributions of fundamental prop- 
erties such as luminosity, surface brightness, spectral type 
and star-formation rate (Folkes et al. 1999; Cole et al. 2001; 
Madgwick et al. 2001a; Cross et al. 2001). 

5. To quantify the relationships between the internal 
properties of galaxies (such as luminosity, spectral type and 
star-formation rate) and their external environment (the lo- 
cal density of galaxies and the surrounding large-scale struc- 
ture) in order to constrain models of galaxy formation and 
evolution (Norberg et al. 2001a). 

6. To investigate the properties of galaxy groups and 
clusters, not just from existing cluster catalogues (De Pro- 
pris et al. 2001), but by defining a large, homogeneous sam- 
ple of groups and clusters in redshift space, avoiding the 
problems of cluster catalogues defined in projection and al- 
lowing detailed study of the mass distributions and dynam- 
ical evolution in the most massive bound structures in the 
universe. 

7. To provide a massive spectroscopic database for use 
in conjunction with other surveys, for finding rare and inter- 
esting types of object, and as a source for a wide variety of 
follow-up observations (Cole et al. 2001; Sadler et al. 2001, 
Magliocchetti et al. 2001). 

This paper provides an overview of the survey and de- 
tailed description of the survey observations. The layout of 
the papers is as follows: §2 summarises the main capabilities 
of the 2dF multi-fibre spectrograph; §3 describes the input 
source catalogue for the survey; §4 discusses the survey de- 



sign, including the areas of sky covered by the survey and 
the tiling of the survey fields; §5 outlines the algorithm used 
to assign fibres to targets, and its uniformity and complete- 
ness; §6 describes the spectroscopic data obtained for the 
survey and the data reduction methods; §7 deals with the 
estimation of the redshifts and various internal and exter- 
nal checks on their reliability and precision; §8 describes the 
survey masks, which encapsulate the coverage, magnitude 
limits and redshift completeness of the survey; §9 outlines 
the main components and features of the survey database; 
§10 summarises some of the main results emerging from the 
survey; and §11 provides conclusions. 



2 THE 2DF SPECTROGRAPH 

The survey is designed around the Two-degree Field (2dF) 
multi-fibre spectrograph on the Anglo- Australian Telescope, 
which is capable of observing up to 400 objects simultane- 
ously over a 2° diameter field of view. Here we summarise 
the aspects of the instrument relevant to the 2dFGRS; a full 
description is provided by Lewis et al. (20 01) and the 2dF 
User Manual (jhttp: / /www. aao.gov.au/2di ) . The four main 



components of 2dF are the corrector optics, the fibre posi- 
tioner, the fibres and the spectrographs. 

The 2dF corrector optics are a four-element lens assem- 
bly which gives 1" images over a 2.1° diameter flat field of 
view at the prime focus of the AAT. The corrector incorpo- 
rates an atmospheric dispersion compensator (ADC) which 
corrects for atmospheric dispersion at zenith distances less 
than about 70° . The most signiflcant remaining image degra- 
dation is a chromatic variation in distortion, which has the 
effect of dispersing images in the radial direction. This effect 
is largest halfway out from the fleld centre, where the wave- 
length range 0.35-1.0/im is dispersed over 2" radially. The 
radial distortion introduced by the corrector gives an image 
scale that varies over the field of view, from 15.5"/iiim at the 
centre to 14.2"/mm at the edge; the corresponding change 
in focal ratio is from f/3.4 to f/3.7. 

The 2dF flbre positioner X-Y robot takes about 6-7 sec- 
onds on average to position one fibre. Since approximately 
550-580 fibre moves are required to re-configure a typical 
400-fibre field, this means that a full re-configuration takes 
about 60-65 minutes. The actual configuration time varies 
depending on the number of fibres used and the complexity 
of the field. To avoid dead-time there are two field plates, 
each with 400 fibres. While one field plate is placed at the 
focal plane for observing, the other is being re-configured 
by the fibre positioner. The positions of the field plates are 
reversed by tumbling them about their horizontal axis. 

The fibres are have 140/im diameter cores, correspond- 
ing to 2.16" at the field centre and 1.99" at the field edge. 
There are 400 fibres on each field plate. The internal preci- 
sion with which the fibres are positioned is ll/xm (0.16") on 
average, with no fibres outside 20fim (0.3"). The fibres ter- 
minate in magnetic 'buttons' that attach to the steel field 
plates. Each button has a right-angle prism which directs 
the light from the focal plane into the fibre. To prevent fibre 
buttons coming into direct contact and protect the fragile 
prisms, the fibres cannot be placed too close together. The 
absolute minimum fibre separation is 800^m (approximately 
12"), and the required separation is generally larger (approx- 
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imately 30"), as it depends on the detailed geometry of the 
buttons and their relative orientations. 

Half the fibres on each field plate go to one of the two 
identical spectrographs. Each spectrograph has an f/3.15 off- 
axis Maksutov collimator feeding a 150mm coUimated beam 
to the grating and then, at an angle of 40°, to an f/1.2 wide- 
field Schmidt camera. The detectors are Tektronix CCDs 
with 1024x1024 24^m pixels. The 200 fibre spectra imaged 
onto each CCD are separated by approximately 5 pixels. 
The 2dFGRS used the 300B gratings, which are blazed at 
4200A and give a dispersion of 178.8A/mm (4.3A/pixel). For 
a typical spectrograph focus of 2.1 pixels (FWHM), this cor- 
responds to a FWHM spectral resolution of 9.0A. The 2dF- 
GRS observations used a central wavelength around 5800A, 
and covered the approximate range 3600-8000 A. Flexure is 
less than 0.2 pixel/hour (i.e. less than 40kms~^ over a typ- 
ical integration time). 

The overall system efficiency (source to detector) of 
2dF with the 300B gratings used in the 2dFGRS is 2.8% 
at 4400A, 4.3% at 5500A and 4.7% at 7000A. These figures 
were obtained from measurements of photometric standard 
stars, corrected to nominal 1" seeing. 



3 SOURCE CATALOGUE 

The source catalogue for the survey (Maddox et al. 2001, 
in preparation) is a revised and extended version of the 
APM galaxy catalogue (Maddox et al. 1990a,b,c;1996). This 
catalogue is based on Automated Plate Measuring machine 
(APM) scans of 390 plates from the UK Schmidt Telescope 
(UKST) Southern Sky Survey. The extended version of the 
APM catalogue includes over 5 million galaxies down to 
bj=20.5 in both north and south Galactic hemispheres over 
a region of almost 10'* deg^ (bounded approximately by dec- 
lination 5<+3° and Galactic latitude ti>30°). Small regions 
around bright stars, satellite trails and plate fiaws are ex- 
cluded; these are accounted for by the survey mask (see 

The b.i magnitude system for the Southern Sky Survey 
is defined by the response of Kodak HIaJ emulsion in combi- 
nation with a GG395 filter. It is zeropointed to Vega I.e. Dj 
is equal to Johnson B for an object with zero colour in the 
Johnson-Cousins system. The colour equation is normally 
taken to be 



b, = B - 0.28(B - V) 



(1) 



following Blair & Gilmore (1982). A larger coefficient 
(—0.35) has been suggested by Metcalfe et al. (1995), but 
we measure — 0.27±0.02 in comparison with the ESO Imag- 
ing Survey (Arnouts et al. 2001), and we therefore retain 
the usual value of —0.28. The photometry of the catalogue 
is calibrated with numerous CCD sequences and, for galax- 
ies with b,i = 17-19.45, has a 68% spread of approximately 
0.15 mag, but with a non-Gaussian tail to the error distri- 
bution. We emphasise that the calibration is to total CCD 
photometry, which absorbs any remaining correction to the 
thresholded APM magnitudes. 

The star-galaxy separation is as described in Maddox 
et al. (1990b), and the locus dividing stars and galaxies was 
chosen to exclude as few compact galaxies as possible while 
keeping the contamination of the galaxy sample by stars to 



about 5%. Spectroscopic identifications of the survey objects 
(see §^), show that the stellar contamination is in fact 6%. 

The source catalogue is incomplete at all magnitudes 
due to various effects, including the explicit exclusion of ob- 
jects classified by the APM as merged images, the misclas- 
sification of some galaxies as stars, and the non-detection 
(or misclassification as noise) of some low surface brightness 
objects. This incompleteness has been studied in compar- 
isons with deeper wide-area CCD photometry by Pimbblet 
et al. (2001) and Cross & Driver (2001, in preparation). The 
overall level of incompleteness is 10-15% and varies slightly 
with apparent magnitude, being largest for the brightest 
and faintest objects. The main classes of objects that are 
excluded are: (i) merged galaxy images that are explicitly 
excluded from the 2dFGRS source catalogue (about 60% of 
the missing objects) ; (ii) large galaxies that are resolved into 
components that are classified as stellar, merged or noise ob- 
jects (20%); (iii) compact normal galaxies that are detected 
but classified as stars (15%); and (iv) low surface brightness 
galaxies that are either not detected or classified as noise 
objects (5%). Thus the main cause of incompleteness is mis- 
classification of objects rather than their non-detection. 

The target galaxies for the 2dFGRS were selected 
to have extinction-corrected magnitudes brighter than 
bj=19.45. Since the targets were selected, improvements to 
the photometric calibrations and revised extinction correc- 
tions have resulted in slight variations to the magnitude limit 
over the survey regions — this is precisely quantified by the 
magnitude limit mask for the survey (see §?.l). The bj ex- 
tinction is taken to be Ab j==4:.Q35E{B—V), where the co- 
efficient, and the reddening E{B—V) as a function of posi- 
tion, come from Schlegel et al. (1998). The limit of b,, = 19.45 
was chosen because: (i) The surface density of galaxies at 



bj=19.45 (approximately 165 deg 



sufficiently larger 



than the surface density of 2dF fibres on the sky (127deg~^) 
to allow efficient use of all fibres — few fibres are unused even 
in low-density fields, (ii) The time taken to configure a typ- 
ical field (60-65 min) allows, with overheads, a sufficiently 
long exposure time to reach the desired signal-to-noise level 
of 5/A''>10pixel~^ for galaxies with b.j=19.45 even in rather 
poor conditions. This limiting magnitude corresponds to a 
median redshift for the survey of about z=0.1, so that the 
2dFGRS is essentially a survey of the local universe. 



4 SURVEY DESIGN 
4.1 Survey areas 

The areas of the sky covered by the survey were chosen so 
as to satisfy a number of different requirements. The first 
goal was to cover as large a volume as possible, in order to 
closely approach a statistically representative sample of the 
universe on the largest possible scales. The second was to 
obtain near-complete sampling down to the survey limit in 
order to have the finest possible resolution of structure on 
small scales. The third requirement was to match the sample 
to the observational capabilities of the 2dF instrument in 
order to achieve high efficiency. The adopted geometry is an 
effective compromise between these requirements. 

The survey consists of two separate declination strips 
of overlapping 2° fields plus 99 scattered 'random' 2° fields. 
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Figure 1. The 2dFGRS regions sliown in an AitofT projection of R.A. and Dec, with individual 2dF fields marked as small circles. Also 
shown are the lines of Galactic latitude |fe[=0°, 30°, 45°. The numbers of survey galaxies in these regions are: 193 550 in the 643 fields of 
the 80° X 15° SOP strip, 139 144 in the 450 fields of the 75° X 10° NGP strip, and 57 019 in the 99 fields scattered around the SOP strip. 





Figure 2. The survey fields in the NGP (loft) and SGP (right) on maps of the extinction A;, derived from Schlegel et al. (1998). 



One strip (the SGP strip) is in the southern Galactic hemi- 
sphere and covers approximately 80° x 15° centred close to 
the South Galactic Pole (21'' 40"° < a < 03'' 40™, -37.5° < 
5 < -22.5°). The other strip (the NGP strip) is in the north- 
ern Galactic hemisphere and covers 75°xl0° (09''50'" < Q < 
14''50'", -7.5° < <5 < -h2.5°). The 99 'random' fields are cho- 
sen from the low-extinction region of the APM catalogue in 



the southern Galactic hemisphere outside the survey strip 
(the mean extinction over each field is required to be less 
than 0.2 mag— see Figure 2 of Efstathiou & Moody (2001)). 
The fields are chosen pseudo-randomly within this region, 
except that the field centres are at least 3° apart. A map 
of the survey fields on the sky is shown in Figure |l| the 
locations of the fields with respect to the extinction map 
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\ (mag) 

Figure 3. The distribution of extinction corrections Af,j with 
Galactic latitude b (dots and left axis) and the fraction of correc- 
tions larger than a given value (line and right axis). 



derived from Schlegel et al. (1998) are shown in Figure g 
All the survey fields lie at Galactic latitudes greater than 
|&|=30°, and the whole of the SGP strip and most of the 
NGP strip and the random fields lie at Galactic latitudes 
greater than |6|=45°. 

The distribution of extinction corrections as a func- 
tion of Galactic latitude, and the fraction of corrections 
larger than a given value, are shown in Figure ^. Over- 
all, the median correction is 0.07 mag, 90% are less than 
0.16 mag, and 99% are less than 0.26 mag; the corre- 
sponding quantiles in the NGP are (0.12,0.19,0.28) mag, 
in the SGP (0.05,0.07,0.11) mag and in the random fields 
(0.07,0.13,0.30) mag. 

The 2dFGRS target sample of galaxies contains 193 550 
galaxies in the SGP strip, 139 144 galaxies in the NGP strip, 
and 57019 galaxies in the random fields. This gives a total 
of 389 713 possible targets, significantly more than the sur- 
vey goal of 250 000 galaxies. Survey observations of the NGP 
and SGP strips are proceeding outwards in declination from 
the centre of each strip towards this goal. Note that the to- 
tal number of galaxies listed in the survey source catalogue 
(and the survey database) is 467 214, which is larger than 
the number of possible survey targets because the source 
catalogues for the NGP and SGP strips conservatively in- 
cludes galaxies fainter than the spectroscopic survey magni- 
tude limit, down to b,i = 19.6. 

At the median redshift of the survey {z—0.11) the 
SGP strip extends over 400 /i"^ Mpcx75 /i"^ Mpc, and the 
NGP strip over 375 Mpcx50 h"^ Mpc. Out to the effec- 
tive limit of the survey at 2~0.3, the strips contain a vol- 
ume of 1.2x10* h-^ Mpc='(for n^=0.3, D.a=0.7); the volume 
sparsely sampled including the random fields is between two 
and three times larger. 



4.2 Tiling the survey 

The survey limit of b,i=19.45 was chosen, in part, because 
it gives a good match between the surface density of galax- 
ies and the surface density of 2dF fibres. Due to clustering, 
however, the number of galaxies in a given field varies con- 
siderably. The rms variation in the number of galaxies per 
randomly-placed 2° field is 140 at bj = 19.5, and is largely 
independent of the choice of magnitude limit over the range 
considered here. To make efficient use of 2dF we therefore 
require an algorithm for tiling the sky with 2° fields that 
allows us to cover the survey area at a high, and nearly 
uniform, sampling rate with the minimum number of 2dF 
fields. 

So long as the sampling of the source catalogue is not 
biased in any way that depends on the photometric or spec- 
troscopic properties of the galaxies, we can always use the 
source catalogue to accurately determine the sampling rate 
as a function of position (see §^). The sampling can then be 
accounted for in any analysis. However to keep such correc- 
tions to a minimum, considerable effort has been invested in 
making the sampling as complete and uniform as possible. 

There are a number of possible approaches to laying 
down target field centres. The simplest is to adopt a uniform 
grid of equally spaced centres and then either randomly sam- 
ple each field with the number of available fibres or observe 
each field several times until all the galaxies have been ob- 
served. The second of these options is clearly inefficient, as it 
will give rise to a large number of fields being observed with 
significantly less than the full complement of fibres, while 
the first is undesirable as it gives a different sampling factor 
for each field. A more efficient solution is to use an adaptive 
tiling strategy, where we allow each field centre to drift from 
the regular grid so that we maximise the number of targets 
that are assigned to each field, subject to the constraint that 
the number of targets assigned to any one field should not 
exceed the number of available fibres, Nf. 

We begin with a uniform grid with field centres equally 
spaced by s = \/3° in right ascension along rows 1.5° apart 
in declination. For each galaxy in the survey we then deter- 
mine how many fields it lies within, and assign each field a 
weight Wi, where 

w,= if N,> Nf (2) 
= 1-^ ffiV. <iV^ 

and Ni is the number of galaxies in this field. These weights 
are normalised such that X^^* Wi = 1, where Nt is the num- 
ber of fields that this galaxy could be assigned to. The galaxy 
in question is then randomly assigned to one of the fields 
using these weights, unless all the fields are already filled, 
in which case it is assigned to the first field in which it 
was found. Once all the field occupancies have been deter- 
mined in this way we move each field in right ascension by 
an amount 

Sm^ Aa + 0.05s^ i^{N^>Nf) (3) 

= Aa-s'(l-^) ff(iV, <7V;) 

where s is the maximum allowed separation between adja- 
cent fields, s' is the current distance to the neighbouring 
field centre, and Aa is the cumulative shift that has cur- 
rently been applied to this row. New fields are added at 
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the end of each row if the total length of the row has con- 
tracted enough to exclude any galaxies at the trailing edge. 
We found that using a fixed separation of 1.5° in declina- 
tion, and adjusting the tile positions in right ascension only, 
provided sufficient flexibility to achieve uniform high com- 
pleteness without a large increase in the total number of 
flelds and without leaving gaps in the sky coverage. 

In practice, it is found that the above prescription re- 
quires a further modification to account for the position of 
each object within the field. This additional constraint arises 
because of the physical restrictions on the positioning of in- 
dividual fibres, both in terms of their deviation from the 
radial angle and their extension from the parked position at 
the edge of the field. We apply this constraint by dividing 
each field into 36 sub-fields and restricting the number of 
targets that can be assigned to each sub-field to 16. This is 
larger than the number of fibres whose park positions fall 
within the arc of the sector, but allows for the fact that the 
area of the field that can be reached by each fibre is increas- 
ing with the fibre extension. Without this extra constraint 
the algorithm tends to place large clusters in the overlapping 
areas of neighbouring fields where, although there are more 
available fibres because of the overlap, it rapidly becomes 
impossible to use these fibres because of the high density of 
targets close to the edge of each field. Limiting the number of 
targets within each sector effectively removes this problem, 
and so increases the uniformity of the survey completeness. 

The adaptive tiling algorithm also needed to cope with 
the requirement that the galaxy redshift survey be merged 
with the concurrent survey of QSO candidates (the 2dF QSO 
Redshift Survey: Boyle et al. 2000; Groom et al. 2001). This 
results in a higher surface density of targets in the region of 
overlap, for which we compensate by reducing the separation 
in declination of the tiling strips in the QSO survey regions 
to 75% of the original value (i.e. to 1.125°). As the QSO sur- 
vey occupies the central declination strip of our SGP survey 
region this results in a 3-4-3 arrangement of tiling strips over 
the three rows of UKST fields used in the SGP survey. A 
similar consideration applied to the NGP survey region gives 
a 3-4 arrangement if we consider the full areas of the two 
rows of UKST fields used. 

With these additional modifications, our tiling algo- 
rithm was able to achieve an overall sampling rate of 99.8%, 
dropping to 93% aft er the assignment of fibres to targets has 
been made (see §5.3). On average, fewer than 5% of the fibres 
in each field were unused. For comparison, the mean sam- 
phng rate that would be achieved for randomly distributed 
fields is 68%, with an rms variation of 15%. 



5 FIBRE ASSIGNMENT 

The tiling procedure described above fixes the location of 
the 2dF target fields on the sky and provides a first pass 
priority scheme (based on the random field assignments) for 
use in assigning targets to fibres. However, the mechanical 
constraints of 2dF imply that we cannot usually allocate 
fibres to all the targets assigned to a particular field, and so 
we consider all targets that lie within each field boundary. 
Our tiling scheme implies that many of the survey targets 
are found on more than one field, so we adopt a priority 
scheme as follows: targets which are unique to the field in 



question are assigned the highest priority, then targets which 
are assigned to the field, but which can also be reached from 
neighbouring fields, then targets which fall within the field, 
but which are assigned to a different, overlapping field. The 
priorities of the QSO targets are increased one step to ensure 
that we do not imprint the strong clustering pattern of the 
galaxy distribution on the weak clustering expected from 
the QSO sample. 



5.1 Single-field assignments 

The allocation procedure for a single field can be divided 
into two distinct steps: an initial allocation pass and a fibre 
swapping pass. These two steps are repeated for each range 
of decreasing priorities for targets that have been assigned to 
this field. An attempt is then made to reduce the complexity 
of the configuration by identifying pairs of allocated fibres 
which are crossed, and which can safely be exchanged. The 
allocation and uncrossing steps are then repeated to allow 
consideration of the lower priority targets which were not 
assigned to this field. The details of these steps are as follows: 

(1) Initial allocation pass. For each target, t, of a given 
priority we determine the set, J-t, of all fibres that could be 
allocated to that target in the absence of any other alloca- 
tions. We then determine the set, %, of all targets which fall 
within the sector defined by the target and the two most ex- 
treme fibres in J-'t. We then use the difference N (Tt) — N (J-'t) 
between the number of targets in Tt and the number of fibres 
in J^t to determine which targets are hardest to access and 
hence should be given the highest priority: targets within 
the sector defined by the target with the largest value of 
N{Tt) — N[Tt) are allocated first. This procedure is then 
repeated for targets of successively lower priorities. 

(2) Fibre- swapping pass. For each remaining unallo- 
cated target, u, we now consider each fibre in 7^. For each 
fibre in this set that could still be allocated to u given the 
current configuration we make a tentative re-allocation of 
this fibre from its initial assignment, i, to target u and check 
to see if any unallocated fibres can be assigned to i. If no 
unallocated fibres can be assigned we repeat this process 
recursively until one of the following conditions have been 
satisfied: either (i) a previously unallocated fibre is allocated 
(implying that u has been allocated and that all previously 
allocated targets remain allocated); or (ii) the search exceeds 
a depth of 10 iterations. In the latter case, if at any point in 
the search a fibre has been moved from a low-priority tar- 
get such that a target of higher priority has been allocated 
then the search is unpacked to this point; if no such trade- 
off was found then the search is unpacked to the original 
configuration. 

(3) Fibre-uncrossmg pass. Each pair of fibres in the con- 
figuration is tested to see if the fibres cross. For each crossed 
pair an attempt is made to swap the allocations of fibres to 
targets. This process is iterated until no further swaps can 
be made. The net effect of this procedure is that the fi- 
nal configuration supplied to the positioner is simplified so 
that the transition to the following configuration will require 
fewer fibres to be moved twice. This simplification effectively 
reduces the time required for each re-configuration, and is 
achieved without constraint on the allocation. 
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Figure 4. The distribution of tlie 7% of objects in the sample that are not allocated to fibres (NGP strip at top, SGP strip at bottom). 



5.2 Field-overlap assignments 

While the above procedure provides an optimised method 
of allocating as many fibres as possible to a given individual 
field, it does not make optimal use of the flexibility provided 
by the overlapping fields. We therefore ran a second pass of 
the configuration algorithm which considered each pair of 
overlapping fields as an 800 fibre problem in which all objects 
found within the overlap area are matched up, so that an 
object which is allocated in one field can be allocated to 
the second field to allow acquisition of unallocated targets 
in the first field. Compared to the results of treating the 
two overlapping fields as separate configuration problems, 
this procedure proves more effective at reducing the overall 
incompleteness due to the instrumental constraints. Since 
the tiling algorithm tends to place local over-densities close 
to the edges of tiles, this procedure also serves to homogenise 
the distribution of incompleteness as a function of position 
within a field. 

A significant fraction of this incompleteness arises due 
to close pairs of objects which lie within the unique regions 
of individual fields. The separation at which this occurs is 
a strong function of the relative orientation of the two fibre 
buttons (see Lewis et al. 2001). The upper limit on the value 
of this separation is 11mm in the focal plane (corresponding 
to 2.2'), while the mean separation of close pairs of objects 
for which one target cannot be assigned is ~30". 



5.3 Fibre assignment completeness 

The results of this procedure are that we are able to al- 
locate fibres to 93% of the source catalogue objects. The 
distribution of the unallocated objects on the sky are shown 



in Figure The most prominent features visible in these 
distributions are occasional localised clusters of unallocated 
objects. These are due to over-dense regions where the ge- 
ometrical packing constraints imposed by the fibre button 
dimensions mean that it has not been possible to assign ev- 
ery fibre to a target, even though there are enough fibres 
available as a result of the tiling algorithm. This effect is en- 
hanced by the relative increase in the number of close pairs 
in strongly clustered regions. 

The distribution of these unallocated objects is also 
shown as a function of position within the 2dF field in Fig- 
ure ^. This plot shows a slight overall gradient in the R.A. 
direction, which is due to the order in which the field over- 
laps are processed (see 



5.2). This apparent gradient is mis- 



leading, since the missed objects are plotted according to 
the field plate coordinates on the plate to which they were 
assigned, whereas the majority of objects which contribute 
to this effect are actually missed on more than one 2dF field. 
The next most prominent feature of Figure ^ is the imprint 
of the four fibres which are used for field acquisition and 
guiding. These fibres must be allocated with a higher prior- 
ity than the targets if the field is to be observed successfully. 
They prevent the subsequent allocation of target objects ly- 
ing along their length, and so produce a slight increase in 
the numbers of unallocated objects along the four cardinal 
axes of the field, in the regions of the plate where they can 
be placed. 

It should be noted that these figures do not represent 
the final incompleteness in the survey, since individual target 
allocations are adjusted immediately prior to observation to 
account for the actual number of fibres available to 2dF on 
any given night (typically, about 1% of the fibres are broken 
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5. The distribution of the unallocated objects within the 
2dF fields. 



at any given time, though they are continually replaced). 
The overall incompleteness at this stage, before any obser- 
vations have been made, is 7%. The level of this incomplete- 
ness is such that the pattern is dominated by residuals due to 
clustering within individual two-degree fields, implying that 
an attempt to correct for this efTect would effectively imprint 
the observed pattern onto the data. We therefore conclude 
that we have achieved our goal of reducing the mean in- 
completeness to an acceptably low level, comparable to the 
level of incompleteness due to the individual uncertainties in 
galaxy magnitudes and the incompleteness due to not mea- 
suring redshifts because of inadequate signal-to-noise. We 
emphasise that neither the tiling nor the fibre allocation de- 
pend on the photometric or spectroscopic properties of the 
objects, so that we can use the source catalogue to accu- 
rately determine the sampling of the survey, as discussed in 



5.4 Visual inspection 

Postage stamp images of each target galaxy and each can- 
didate fiducial star were generated from the Digitized Sky 
Survey[] (DSS; Lasker et al. 1998). These DSS images were 
used to examine the target galaxies for merged images and 
to remove unsuitable fiducial star candidates. 

The APM image classification parameter k (see Mad- 
dox et al. 1990b) was used to find target galaxies which 
might possibly be merged images. Objects with A:>1.15 in 
the SGP and fe>1.2 in the NGP were analysed by a sim- 
ple routine which attempts to relocate the fibre onto the 



^ The Digitized Sky Survey was produced at the Space Telescope 
Science Institute, and is based on photographic data obtained 
using the Oschin Schmidt Telescope on Palomar Mountain and 
the UK Schmidt Telescope. 



local maximum of surface brightness closest to the nomi- 
nal target position. This is achieved by starting from the 
central pixel of the DSS image and allowing the position 
to move in a direction of increasing intensity until a local 
maximum is found. The position corresponding to this lo- 
cal maximum is then inserted into the 2dF configuration file 
prior to observation. All objects which are tested in this way 
are flagged as merged images by changing the second char- 
acter of the object's OBSNAME parameter from G to M, even 
if the final position of the fibre is unchanged. The survey 
database records both the original source catalogue position 
(database parameters RA and DEC; see §9.2)and the actual 
observed position (database parameters OBSRA and OBSDEC) ; 
the parameter MATCH_DR gives the offset (in arcsec) between 
these two positions, and may be non-zero for objects flagged 
as mergers. 

The DSS images of candidate fiducial stars are in- 
spected visually to allow removal of the following types of 
object, which would compromise field acquisition: (i) stars 
which are merged with another object; (ii) stars which are 
close to other stars of comparable brightness; (iii) bright 
stars which are strongly saturated on the sky survey plates; 
(iv) spurious objects that are fragments of the diffraction 
spikes or halos of bright stars, or bright regions within 
large galaxies; (v) asteroids and portions of satellite trails; 
(vi) spurious objects due to noise. 



5.5 Final adjustments 

The configuration procedure described above determines the 
optimal assignment of fibres to targets over the whole sur- 
vey. This procedure assumes a fixed set of parameters for 
the transformation of sky-coordinates to plate-coordinates, 
whereas in practice a slightly different transform is deter- 
mined each time the instrument is mounted on the telescope. 
The above procedure also assumes a full complement of fi- 
bres, and does not allow for the small but significant attri- 
tion rate as flbres are disabled due to optical or mechanical 
failures. Fibres which are disabled in this way are repaired 
on a regular basis, but operational constraints require that 
disabled fibres are replaced in batches, and so there is gen- 
erally some delay before a particular fibre is replaced. 

We account for both the fibre attrition and the effects 
of changes to the coordinate transform by re-examining each 
configuration the day before that field is to be observed. All 
broken or inoperative fibres which had been allocated are 
deallocated and the targets to which they were assigned are 
flagged. An attempt is then made to recover each flagged 
target using the current list of available fibres, according to 
the same algorithm used for the original allocation. Again, 
low-priority targets may be lost at this stage in favour of 
high-priority targets. In practice, we usually find that the 
majority of the high-priority targets which were flagged can 
either be recovered or replaced. 

A similar check is then made to allow for a further 
change to the coordinate transform due to the hour angle at 
which the field is to be observed. This correction is a com- 
bination of the effects of atmospheric refraction and fiexure 
and misalignment of the telescope itself. Each configuration 
is checked for hour angles of ±4 hours, and adjustments are 
made to ensure that no fibre conflicts will occur within this 
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range. Again, we usually find that all such conflicts can be 
recovered without loss of target allocations. 

After this correction to the configuration, a number of 
unallocated fibres (at least 10 for each spectrograph) are al- 
located to blank sky positions. This is done initially using an 
azimuthally symmetrical grid of positions around the field, 
but if insufficient fibres can be allocated to this grid, more 
are allocated to other blank sky regions manually. The posi- 
tion of each allocated sky position is then checked to ensure 
that it is genuinely blank by examining the surrounding area 
using the Digitized Sky Survey. Any sky fibres likely to be 
contaminated by objects visible on the DSS are reallocated 
to other blank sky regions. 



6 SPECTROSCOPIC DATA 
6.1 Observational procedure 

A target field is acquired using the four guide fibres, each of 
which consists of 6 fibres in a hexagonal pattern surround- 
ing a central fibre. The individual fibres are 95^m (1.4") 
in diameter and have centre-to-centre separations of 1.8". 
These guide fibres are positioned on 14<V<15 stars with 
positions measured from the same UKST plates as the tar- 
get galaxies. As the blue sky survey plates are often 20-25 
years old, we use the more recent red sky survey plates to 
check for any proper motion, and we also apply a colour cut 
to reduce the number of nearby stars. On some fields, where 
red plates are not available, we have used fiducial stars from 
the USNO Al.O astrometric catalogue (Monet et al. 1996). 

The guide fibres are viewed by the AAT's standard ac- 
quisition and guiding TV system. The position for optimum 
acquisition of the fiducial stars, and subsequent guiding cor- 
rections, are determined by visual inspection of the locations 
of the stars in the guide fibres on the TV image. Acquisition 
of a new field typically requires less than 5 minutes. 

For each field we first take a multi-fibre fiat-field ex- 
posure using the quartz lamp in the calibration unit. This 
fiat-field is used to trace the positions of the fibres on the 
image, to fit the spatial profile of each fibre as a function 
of wavelength, and to apply a 1-dimensional pixel-to-pixel 
fiat-field correction to the extracted spectra. We next take a 
wavelength calibration exposure of helium and copper-argon 
arc lamps. We then typically take three 1100s exposures of 
the target objects. However the exposure time is varied to 
suit the conditions: for galaxy-only fields in good conditions, 
three 900s exposures are sufficient; in poor conditions, espe- 
cially for fields including targets from the QSO survey, a 
longer total integration time is used. With the CCD read- 
out time of 60s, the whole series of exposures typically takes 
just over an hour, so that the observations of one field are 
finished just as the configuration for the next field is com- 
pleted. 



6.2 Data reduction 

The data are reduced using the 2dF data reduction pipeline 
software, 2df dr, a fu ll description of which is give n by Bailey 
et al. (2001; see also ^ittp: //www. aao.gov.au/2dl ). The main 
steps in the process are as follows: 

1. Basic image processing. Each image undergoes the 



same initial processing, which consists of fiagging bad pixels 
(both saturated pixels and those specified in the bad-pixel 
mask for each CCD). The bias level is computed as the me- 
dian of the CCD overscan region and subtracted from the 
image. The image is then trimmed to remove the overscan 
region. Finally, a variance array is computed based on the 
pixel data values and the known properties of the CCDs. 
The variances are propagated through all subsequent reduc- 
tion steps. 

2. Mapping the spectra. The next step is to map the 
locations and shapes of the spectra on the detector. The 
centroids of the fibre spectra along the slit are determined 
using the fibre flat fleld image, and the fibre identifications 
checked visually. This is necessary because the positioning 
of the slit block is not precisely reproducible, so that the 
locations of the fibres on the CCD can change by a few 
pixels, and occasionally the first or last fibre may fall off the 
edge of the detector. Visual checking ensures that spectra are 
correctly identified with fibres and thus with the objects that 
were observed. Given this starting location, 2df dr finds the 
centroids of each fibre spectrum as a function of wavelength 
and fits a model based on the known optical properties of 
the spectrograph. For each subsequent frame, this model is 
adjusted (to account for flexure) by fitting an overall shift 
and rotation to the positions of the spectra on the detector. 

3. Subtracting the background. The background of scat- 
tered light is fitted with an empirically-determined model 
using the unilluminated portions of the detector. Fibres are 
so closely packed on the detector that the light level be- 
tween transmitting fibres does not drop to the background 
level. Hence the only unilluminated areas are at the edges 
of the detector and at the positions of dead (broken, non- 
transmitting) fibres. A small number of fibres are broken at 
any one time and are continually being replaced. Occasion- 
ally the number of broken fibres becomes sufficiently small 
that it is not possible to reliably fit the background, and this 
step must be omitted. 

4. Fitting the spatial profiles. The close packing of the 
fibres on the detector means that their spatial profiles over- 
lap, producing cross-talk between neighbouring spectra. In 
order to correct for this, it is necessary to know the spatial 
profile of each fibre spectrum as a function of wavelength. 
These profiles are determined from the fibre fiat field image 
by performing simultaneous fits of 200 Gaussians to the fibre 
profiles at each of 20 wavelength ranges. These fits are then 
interpolated in the wavelength direction with cubic splines 
to give the spatial profile for each fibre at every wavelength 
on the detector. 

5. Spectrum extraction. Given the location and shape of 
each fibre's spatial profile, optimal extraction consists of a 
weighted fit of the profiles to recover the amplitude of the 
spectrum at each wavelength. Because the spectra overlap, 
a simultaneous fit of the profile amplitudes of each fibre 
spectrum and its nearest neighbour on either side is carried 
out. The fit is performed using least-squares with variance 
weighting. 

6. Flat-fieldmg. The extracted spectra are then 'flat- 
fielded' by dividing by the pixel-to-pixel variations in the 
extracted spectra from the fibre flat fleld image. This one- 
dimensional flat-fielding is used because there is presently no 
means of generating a full two-dimensional fiat-field image. 

7. Wavelength calibration and linearisation. A predicted 
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wavelength for each pixel, based on the nominal central 
wavelength and the optical model for the spectrograph, is 
first calculated. Lines in the helium/argon arc spectra are 
automatically identified by a peak-finding algorithm, then 
matched to a list of known lines, discarding both unidenti- 
fied peaks and unmatched lines, as as well as known blends. 
The relation between the measured line positions and their 
true wavelengths is fit by a third-order polynomial. This fit 
iterated up to four times, with the most discrepant line ex- 
cluded at each iteration. The final fits usually include 21-22 
lines over the 4400A spectral range, and have typical rms 
residuals of O.SA (0.07 pixels). Once the wavelength calibra- 
tion is determined, the spectrum is re-binned onto a linear 
wavelength scale using quadratic interpolation. 

8. Fibre throughput calibration. The spectrum of the sky 
background is measured with at least 8 fibres (and usually 
more) on each of the two spectrographs. The spectra from 
the sky fibres are normalised by their mean fiuxes and a 
median sky spectrum is calculated. The relative through- 
puts of the fibres are then derived from the relative fluxes 
in the strong sky lines as follows. All object and sky fibres 
and the median sky are continuum-subtracted using a con- 
tinuum derived by median smoothing with a 201-pixel box. 
This removes the continuum leaving only the strong lines in 
the residual spectrum. A robust least-squares fit to the con- 
stant of proportionality between the counts in each pixel of 
the residual object -(-sky spectrum and the residual median 
sky spectrum then yields the relative throughput for each 
fibre (the sky lines in common to both spectra contribute 
to the fit, but strong emission lines found only in the object 
spectrum do not). The individual object and sky spectra are 
then normalised by dividing by their relative throughputs. 

9. Sky subtraction. The median sky spectrum is re- 
calculated from the individual normalised sky spectra, and 
subtracted from each normalised object and sky spectrum. 
The sky-subtraction precision, measured as the ratio of the 
total flux in the sky flbres before and after sky-subtraction, 
is typically 2-3% and rarely worse than 6%. Precise sky- 
subtraction is dependent on effective scattered light subtrac- 
tion, wavelength calibration and throughput calibration. 

10. Combining spectra. The fully-reduced spectra from 
each of the multiple exposures on the target objects (typ- 
ically three) are optimally combined in a flnal step. The 
combination algorithm accounts for overall flux variations 
between exposures (due to different integration times or ob- 
serving conditions) and rejects cosmic ray events. The rel- 
ative weighting of the exposures is computed from the to- 
tal fluxes in the brightest 5% of the spectra (after they are 
median-smoothed over a 201-pixel box to eliminate cosmic 
rays and normalised to the first exposure). The maximum 
fiux weight is unity, and the weights are applied to the in- 
dividual exposures before they compared for cosmic rays. 
The cosmic ray rejection algorithm is similar to that of the 
IRAF crreject package. It fiags likely cosmic ray events, 
which are taken to be those pixels that are more than 5(t 
deviations from the median (over the set of flux-weighted 
exposures) together with some neighbouring pixels. The fl- 
nal pixel value in the combined spectrum is computed as the 
vaxiance-weighted mean over the flux-weighted exposures of 
the unflagged pixels. 

The flnal result of the reduction is a pair of data flies 
(one for each CCD), both of which contain a multi-spectrum 



image (200 flbres x 1024 spectral pixels), the associated 
variance array (which is correctly propagated throughout 
the reduction process), and a copy of the median sky spec- 
trum used in the background subtraction. The data flies also 
contain the identiflcations of the spectra obtained from the 
flbre conflguration file and detailed information about the 
instrument and observation. 

The spectra are not flux-calibrated. Flux calibration of 
flbre spectra is very difficult to achieve for a number of rea- 
sons. Chief among these are: (i) astrometric/positioning er- 
rors, which, combined with the small flbre aperture, prevent 
reproducible sampling of each galaxy's light distribution; 
(ii) chromatic variations in distortion (see §^ and resid- 
ual errors in the atmospheric dispersion correction, which 
produce variations in the flux calibration dependent on the 
attitude of the telescope and the position of the object in the 
fleld. These effects can produce signiflcant variations in the 
continuum slope of the spectra, and are discussed in quanti- 
tative detail in Madgwick et al. (2001b) and Madgwick et al. 
(2001c, in preparation). 

Various problems are exhibited by some of the spectra: 
(i) Poor sky-subtraction. A small but signiflcant fraction of 
flelds suffered sky-subtraction errors of more than 5%. The 
strong oxygen sky lines at 5577A, 6300A, 6363A and the 
OH bands to the red of 6OOOA leave signiflcant residuals in 
the spectra, (ii) Fringing. Damaged fibres sometimes pro- 
duce a fringing effect that results in a strong oscillation in 
the throughput with wavelength, (iii) Halation. Sometimes 
halation due to condensation on the field fiattener lens just 
in front of the CCD results in a high level of scattered light 
which fills in absorption features; when present, this effect 
was worse in CCD camera #2. (iv) ADC error. A soft- 
ware error meant that the atmospheric dispersion corrector 
(ADC) was incorrectly positioned for observations prior to 
31 August 1999; some of the spectra obtained before this 
date suffer from significant atmospheric dispersion losses. 



7 REDSHIFTS 

7.1 Redshift estimation 

The main parameter to be determined from the spectra was 
the redshift of each object. A sophisticated and highly-tuned 
redshift code was developed in order to achieve a high level 
of precision, reliability and automation in the measurement 
of redshifts from the 2dFGRS spectral data. 

The redshift code (version 990505) uses two quasi- 
independent automated methods: 'absorption' redshifts are 
obtained by cross-correlation with template spectra (after 
clipping emission lines) and 'emission' redshifts are obtained 
by finding and fitting emission lines. These automated red- 
shift estimates are followed by a visual check and (occasion- 
ally, where necessary) a 'manual' redshift obtained by fitting 
identified spectral features. 

Before these redshift estimates are made, however, the 
spectra are pre-processed in a number of ways. Residual fea- 
tures from the strong atmospheric emission lines at 5577A, 
5893 A, 6300 A and 7244 A are masked by interpolating over 
a small wavelength range (20-30A) centred on the sky line. 
The atmospheric absorption bands around 6870A (B-band) 
and 7600A (A-band), and the fibre absorption band around 
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Figure 6. The eight template spectra against which the 2dFGRS 
galaxy spectra were cross-correlated. 



7190A, are removed by dividing by an approximate band 
correction. This correction is obtained by fitting a low-order 
polynomial to the mean of all the spectra in the reduced im- 
age, dividing the mean spectrum by this smooth fit, finding 
the region about each band centre where this ratio is less 
than unity, and setting the band correction to the value of 
this ratio in these regions and to unity elsewhere. The spec- 
tra, which are not flux-calibrated, are finally multiplied by a 
simple quadratic approximation to the mean flux calibration 
correction in order to give appropriate weighting across the 
whole spectral range. 

An 'absorption' redshift is obtained by cross-correlating 
the galaxy spectrum against a suite of 8 template spec- 
tra following the general method of Tonry & Davis (1979). 
The templates that are used cover a broad range of spec- 
tral types, and include 5 galaxies and 3 stars. The galax- 
ies (and their morphological types) are: 1. NGC3379 (E), 
2. NGC4889 (cD), 3. NGC5248 (Sbc), 4. NGC 2276 (Sc) 
and 5. NGC 4485 (Sm/lm); the stars (and their spectral 
types) are: 6. HD 116608 (AlV), 7. HD 23524 (KOV) and 
8. BD 05 1668 (M5V). The galaxy spectra come from spec- 
trophotometric atlas of Kennicutt (1992); the stellar spectra 
come from the library of Jacoby et al. (1984). The full set 
of template spectra are shown in Figure |^. 

The galaxy and template spectra are prepared for 
cross-correlation using the following steps: (1) continuum- 
subtraction using a sixth-order polynomial fit to the con- 
tinuum; (2) removal of strong emission lines by patching 



out all points more than 5 times the rms variation above 
the mean; (3) re-binning to a logarithmic wavelength scale 
with 2048 pixels; (4) apodising (tapering the ends of the 
spectrum to zero) by multiplying the flrst and last 5% of 
the spectrum with a cosine bell; (5) Fourier transformation; 
and, flnally, (6) multiplication by an exponential filter of the 
form 6"'°'^'°'' — e"*^/*! ^ where k is wavenumber in inverse pix- 
els, fch=300 pixel"^ and fc; = 15pixel~^, in order to reduce the 
effects of both the residual continuum at low wavenumber 
and the noise at high wavenumber. 

The cross-correlation function is then computed as the 
inverse Fourier transform of the complex product of the fil- 
ter Fourier transforms of the processed galaxy and template 
spectra. The highest peak in cross-correlation function is 
fitted with a quadratic in order to obtain the position and 
height of the peak. The significance of the peak is measured 
in terms of the ratio, R, of the height of the peak to the noise 
in the cross-correlation function, which is estimated from 
its anti-symmetric part. The absorption redshift is taken to 
be the cross-correlation redshift obtained with the template 
giving the highest value of R. An estimate of the quality 
of the absorption redshift, Qa, is based on the value of R, 
with Qa=l, 2,3,4 for i?>3. 5,4.0,4.5,5.0 and Qa=0 otherwise 
(with the additional requirement that Qa=3 and Qa=4 also 
require, respectively, at least 4 and 6 of the 8 templates give 
the same redshift to within 600kms~^). 

An 'emission' redshift is obtained by fitting Gaussians 
to significant spectral features and searching for a multi-line 
match. The galaxy spectrum is first continuum-subtracted 
using a sixth-order polynomial fit to the continuum and 
lightly smoothed with a Gaussian kernel having a disper- 
sion of 0.8 pixels. Peaks in the smoothed spectrum above a 
threshold (set at 3.3 times the robustly-estimated rms vari- 
ation) are flagged as candidate emission lines. For each of 
these, in descending order of strength, a Gaussian is then fit- 
ted to the unsmoothed data, using the peak in the smoothed 
spectrum as the initial guess for the line centre. The Gaus- 
sian fits to the stronger lines are subtracted in turn before 
the weaker lines are fitted (to minimize blending effects in 
the Ha/NII pair). Then lines with fitted FWHM between 0.7 
and 7 pixels, and total 'significance' of the whole line above 
3.5(T are marked as 'good'; this rejects undipped cosmic rays 
and low-significance features. 

The 'good' lines are then sorted by strength, and the 
three strongest tested to see if at least two can be iden- 
tified as common emission lines ([OII]3727A, H/3486lA, 
[OIII]5007A, Ha6563A or [Nil] 6584A) at a single redshift 
(the two out of three criterion is used to allow for one spuri- 
ous feature, such as a residual sky line or cosmic ray). If so, 
then all the other lines in this set (plus [OIII] 4959A) that 
match this redshift to better than 600kms~^ are found and 
the mean redshift from these matching lines is adopted as 
the emission redshift. If not, then up to two possible single- 
line redshifts are kept for comparison with the absorption 
redshift, assuming the single line is either [Oil] or Hq in the 
range 0<2:<0.4, as appropriate. An estimate of the qual- 
ity of the emission redshift, Qe, is based on the number and 
strength of the lines from which it is determined, with Qe=0 
if there are no lines, Qe=l for one weak line, Qe=2 for any 
two lines or one strong line, and Qe=4 for three or more 
lines. 

The penultimate step is the choice of the best auto- 
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mated redshift estimate, given the absorption and emission 
redshifts and their quality parameters. The best redshift is 
taken to be whichever of the absorption and emission red- 
shifts has the higher quality parameter (with the absorption 
redshift preferred if Qa=Qe)- The best estimate of the red- 
shift quality is Q;, =max(Qa,Qe), with two exceptions: (i) if 
the difference between the absorption and emission redshifts 
is less than 600kms~^ then Qb = max(Qa,Qe,3); and (ii) if 
Qa>2 and Qe>2 and the difference between the absorption 
and emission redshifts is greater than 600kms~^, then this 
discrepancy is flagged and we set Q6 = l. 

The final step in measuring the redshift is a visual check 
of the best automatic redshift. The redshift code displays for 
the user both the cross-correlation function and the galaxy 
spectrum with all the common spectral features superposed 
at the best automatic redshift. The spectrum displayed is 
the version actually used for the redshift estimate, with bad 
pixels and the strongest night sky lines patched out, and 
with an approximate correction applied for the instrumen- 
tal response. To assist in identifying spurious features the 
code also displays the mean sky spectrum (to show the posi- 
tions of the sky features), the atmospheric absorption bands 
and the variance array associated with the spectrum. The 
user is given the emission and/or absorption line redshifts, 
the best-estimate redshift, whether this is an emission or 
absorption redshift (or both), and whether (if both are ob- 
tained) the emission and absorption redshifts agree or not. 
If neither of the automated estimates are correct, the user 
can manually estimate a redshift by identifying a particular 
spectral feature and fitting Gaussians at the positions of all 
the common features. These fits can be to emission or ab- 
sorption features and are rejected if there is no detectable 
feature near the nominal position. The manual redshift is 
taken to be the mean of the redshifts estimated from each 
of the well-fitted features. Given all this information, the 
observer can visually inspect the spectrum, try alternative 
redshifts (absorption, emission and manual) and determine 
the best redshift estimate. 

Once the final redshift estimate has been decided, the 
user assigns a quality, Q, to this redshift on a five-point 
scale, with the nominal interpretation (see below for fur- 
ther discussion); Q=l means no redshift could be estimated; 
Q=2 means a possible, but doubtful, redshift estimate; Q=3 
means a 'probable' redshift (notionally 90% confidence); 
Q=4 means a reliable redshift (notionally 99% confidence); 
Q=5 means a reliable redshift and a high-quality spectrum. 
Note that this quality parameter is determined entirely by 
the subjective judgement of the user, and is independent of 
the automatic quality parameter Qi,. Quality classes 1 and 
2 are considered failures, so the redshift completeness is the 
number of Q=3,4,5 redshifts divided by the total number of 
galaxies in the field. The standard redshift sample comprises 
objects with Q>3, but (for applications where redshift relia- 
bility is more important than completeness) one may prefer 
to use the set of objects with Q>4. 

For the objects ending up with 'acceptable' (Q=3,4,5) 
redshifts, the best automated redshift is the same as the best 
visual redshift about 93% of the time; about 5% of the time 
either the absorption or the emission redshift agrees with the 
best visual redshift but the code has preferred the other one; 
and about 2% of the time neither of the estimates matches 



the best visual redshift, and a manual redshift estimate is 
required. 

While there is some element of subjectivity in decid- 
ing whether a redshift estimate is 'acceptable' or not, this 
visual assessment is found to improve the reliability of the 
final sample, since the observer can assess a number of fac- 
tors that are hard to fully automate, such as the overall 
shape of the spectrum, line ratios, the general quality of sky 
subtraction, and the proximity of features to sky lines or ab- 
sorption bands. For 15,000 spectra, two or more users have 
measured redshifts independently using the same redshift 
code on the same spectra; the pair-wise 'blunder' rate (see 
below) is 0.4%, which is several times smaller than that for 
different observations of the same object. The rms difference 
in redshift completeness (fraction of objects per field with 
Q>3) between two users for the same data is less than half 
of that between the same observer measuring redshifts for 
many different fields. The human variation is therefore sub- 
stantially smaller than the unavoidable variation between 
fields due to weather and other effects, and will to first or- 
der be corrected by the completeness factors. 

7.2 Quality of spectra 

A quantitative overall measure of spectral quality is the me- 
dian signal-to- noise ratio (S/N) per pixel computed over 
all pixels in the wavelength range 4000-7500A with fluxes 
greater than zero. This is computed directly from the spec- 
trum and it associated variance array, and can be compared 
to the quality class Q assigned to the redshift identification. 
The distribution of S/N over the 5 quality classes for the 
best spectrum of each object (i.e. the spectrum on which 
the adopted redshift is based) is shown in Figure ^. The 
fraction of objects in each quality class and the median S /N 
for each quality class are given in the figure legend. Objects 
with reliable redshifts (Q=3,4,5) make up 91.8% of the sam- 
ple and have a median S/N of 13.1pixel~^ (6.3 A~^). 

Example 2dFGRS spectra for quality classes Q=3, Q=4 
and Q=5 are shown in Figure ^; for each quality class, one 
object with an absorption redshift and one object with an 
emission redshift are shown. In order to be representative, 
the spectra are chosen to have a S/N per pixel close to the 
median of their quality class. Residuals from the strong night 
sky lines at 5577A and 6300A have been interpolated over. 

7.3 Repeat observations 

Amongst the first 150 000 galaxies in the survey with red- 
shifts, there are 8480 repeat redshift measurements; i.e. a 
second, or even a third, reliable (Q=3,4,5) redshift measure- 
ment for an object. These repeat measurements are either 
objects that lie in the overlaps between fields (7444 cases) 
or are in fields which were observed more than once, usually 
because the first observation was of poor quality and low 
completeness (1036 cases). For these objects we can com- 
pare their best redshift measurement with the repeat mea- 
surements to estimate both the rate of incorrect redshift 
identifications in the survey and the uncertainties on the 
correct redshift identifications. 

The top panel of Figure ^ shows the distribution of 
the redshift differences (w.r.t. the best redshift measure- 
ment for the object) for all the repeat observations. The rms 
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Figure 7. The distribution of the median S/N per pixel for ob- 
jects in each of the 5 quality classes. 



redshift diflerence (robustly estimated here and throughout 
as the half-width of the central 68% of the distribution) is 
120kiTis~^, implying an overall single- measurement rms un- 
certainty of 85kms~^. 

The lower panels of the figure show the trends in the 
redshift uncertainty as a function of redshift, magnitude, 
S/N and quality class Q. As expected, the uncertainties 
increase at higher redshifts (from 95kms~^ for z<0.05 to 
150kms~^ at 2>0.15), fainter magnitudes (from lOOkms"^ 
for bT<18.5 to 140kms"^ for b,>19.3) and lower S/N 
(from 95kms^^ for S/N>25 to 145kms~^ for S/N<5). 
The strongest trends are with redshift quality class and the 
method of estimating the redshift. Table ^gives the rms pre- 
cision of single measurements as function of redshift quality 
class or measurement method (ABS — cross-correlation of 
absorption features; EMI = automatic fit to emission lines; 
MAN = manual fit to features). The fact that manual red- 
shift estimation has the largest uncertainties mainly reflects 
the fact that it is only employed when the automatic meth- 
ods have failed, usually in cases of low S/N. 

We divide the repeat measurements into two categories: 
correct identiflcations, where the redshift difference is less 
than 600kms~^, and 'blunders', where it is greater than 
600kms~^. The division at 600kms~^ corresponds to 5 
times the rms error in the redshift differences. It should be 
noted that these blunders are not necessarily due to incor- 
rect redshift measurements for one (or both) of the observed 
spectra. They may also be due to two observations of a close 
pair of objects with slight offsets in the fibre position, result- 
ing in a different member of the pair dominating the spectral 
ffux in each observation. 

There are 263 blunders, so that the pair- wise blunder 
rate amongst the Q=3,4,5 repeat measurements is 3.1%. 
This irnplies a single-measurement blunder rate of just 1.6%. 
Figure hu shows plots of the redshift differences for the blun- 
ders with respect to redshift, magnitude, S/N and quality 




""^^ : TGN124Z049 bj= 18.32 

- z = 0.0557 Q = 4 EMI 
400 - 




; TGS153Z170 bj=17.77 
3000 - z = o.0420 Q = 5 EMI 



2000 - 




4000 5000 6000 7000 

Wavelength (A) 



Figure 8. Example 2dFGRS spectra, showing objects with qual- 
ity classes Q=3, Q=4 and Q=5, and both absorption (ABS) and 
emission (EMI) redshifts. The most common spectral features are 
marked at the wavelengths corresponding to the measured red- 
shift of each object. 
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Figure 9. The distribution of redshift differences between repeat 
measurements with Q>3 is shown in the top panel, along with the 
Gaussian having the same median and rms. The dependence of 
the redshift differences on redshift, apparent magnitude, signal- 
to-noise ratio and quality class for the best observation, is shown 
in the lower panels (note that the quality classes have small ran- 
dom offsets added to make the distribution of redshift differences 
clear). The lines in the lower panels are the running median and 
the ±2(T ranges. 



Table 1. Summary statistics by quality class and method. 



Quality 


Fraction 


RMS error 


Blunder 


class 


of sample 


( km s"-"-) 


rate 


Q=l 


2.5% 






Q=2 


4.8% 


143 


69.5% 


Q=3 


11.9% 


123 


10.2% 


Q=4 


57.3% 


89 


0.9% 


Q=5 


23.5% 


64 


0.2% 


Redshift 


Fraction 


RMS error 


Blunder 


method 


of sample 


( km s~^ ) 


rate 


Q=l,2 


7.3% 






ABS 


72.0% 


87 


0.7% 


EMI 


18.3% 


61 


0.9% 


MAN 


2.4% 


159 


9.3% 



class; it also shows the pair-wise fraction of blunders as a 
function of each of these quantities. 

There is no increase in the blunder rate with redshift, 
but a significant increase at fainter magnitudes, lower S/N 
and lower quality class. The single-measurement blunder 
rates as functions of quality class and measurement method 




Figure 10. The distribution of the redshift differences for the 
blunders (repeat observations with \l\cz\ > 600 km s"-"^) against 
redshift, magnitude, S/N and quality class. The circles are absorp- 
tion redshifts, the squares are emission rcdshifts and the crosses 
are manual redshifts. The histograms at the foot of each panel 
show the fraction of blunders as a function of redshift, magni- 
tude, S/N and quality class. 



are given in Table |l|. We find that Q=3 corresponds to indi- 
vidual redshifts having 90% reliability, and that Q=4 and 5 
correspond to better than 99% reliability, as intended. The 
blunder rate for Q=2 redshifts is 70%, consistent with the 
definition of Q=2 as 'possible but doubtful'. The blunder 
rates for the absorption and emission redshifts are less than 
1%, but the blunder rate for manual redshifts is 9% (due to 
the fact that manual redshifts are only used in the 2.4% of 
cases where the automatic methods fail). 

The distribution of blunders shows a slight peak around 
zero redshift difi'erence, suggesting that the tail of the error 
distribution for correct redshift identifications is more ex- 
tended than a Gaussian. Some of the blunders may thus be 
correct identifications with larger than normal errors, im- 
plying that we may be slightly over-estimating the blunder 
rate. However, the most notable feature in the distribution 
of blunders is the large number of instances (22% of the to- 
tal) where one or other of the redshifts in the comparison 
is approximately zero. The redshift difference is defined as 
(other redshift — best redshift), so the upper envelope cor- 
responds to objects where the best redshift is zero and the 
lower envelope corresponds to the other redshift being zero. 
In the former case the best redshift indicates that the ob- 
jects are stars and the other (non-zero) redshift is due to a 
blunder based on a poor-quality spectrum; in the latter case 
we have the reverse of this situation. There are 3 times as 
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Table 2. Comparisons of redshifts with external sources. 



Source 


comp 


A^600 


(Acz) 


a(Acz) 








km s ^ 


km s"-"- 


CfA2 


87 


1 


-22 


93 


SAPM 


228 


10 


+11 


123 


PSCz 


172 


7 


-11 


120 


LCRS 


3135 


11 


-1 


109 


ZCAT 


1593 


103 


-33 


127 



many objects in the upper envelope as the lower, so stars 
being mistaken for galaxies is a much more common error 
than galaxies being mistaken for stars — which is remarkable 
given that galaxies are 20 times more common than stars in 
our sample. 

We have repeated the above analyses after separating 
the repeat observations into those cases where the repeat is 
due to re-observing a field (12%) and those where it is due to 
observing an object in overlapping fields (88%). We find no 
significant difference in either the single-measurement rms 
precision or the blunder rate for these two subsets of repeat 
measurements. 

Examination of the individual object spectra in the 
cases where there is a blunder shows that in general the 
best redshift appears to be quite reliable and that the main 
reason for the discrepancy is that the other redshift mea- 
surement has been obtained from a poor-quality (low-Q) 
spectrum. 



7.4 External comparisons 

In order to compare our redshift measurements with those 
from other sources, we have matched our catalogue to 
the redshift catalogues from the second Center for As- 
trophysics redshift survey (CfA2; Huchra et al. 1999), the 
Stromlo-APM redshift survey (SAPM; Loveday et al. 1996), 
the PSCz survey (PSCz; Saunders et al. 2000), the Las 
Campanas redshift survey (LCRS; Shectman et al. 1996), 
and also to the heterogeneous redshift compilation of the 
ZCAT catalogue (version November 13, 2000; J. P. Huchra, 
priv.comm.). Objects in these catalogues were matched to 
objects in the 2dFGRS by position; positions separated by 
4 arcsec or less were assumed to belong to the same object. 
This criterion was chosen to include most genuine matches 
and exclude most false matches between close but unrelated 
objects. 

Table |2| and Figure |l^ summarise the results of these 
comparisons. The table lists, for each external source cat- 
alogue: the number of matched objects in the comparison. 



A^ con 



the number of cases, 



where the redshift dif- 



ference between the 2dFGRS and the other source indicates 
a blunder (|Ac2| > 600kms~^); the median redshift differ- 
ence, (Acz); and the rms scatter of Acz about this median, 
a {Acz). The median and rms scatter are estimated exclud- 
ing the blunders. 

In interpreting these comparisons we need to keep in 
mind the magnitude range of the objects that are in com- 
mon. Figure ^ shows the magnitude distributions of the 
comparison samples, all of which are significantly brighter 
than the 2dFGRS itself. The CfA2, SAPM and PSCz cat- 
alogues only overlap with the bright tail, while the median 
magnitude of the LCRS is a magnitude brighter than that 
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Figure 11. Comparisons of the 2dFGRS redshifts with those 
from the CfA2, SAPM, LCRS and ZCAT catalogues. The left 
panels shown the correlations between the redshift measurements, 
with each comparison shown as a dot. Most dots lie along the 
1-to-l line; the relatively few blunders (|Ac2| > 600kms~^) are 
circled. The right panels show the histogram of redshift differences 
overlaid with a Gaussian having the same median and rms. 



of the 2dFGRS. ZCAT has a broader magnitude range, but 
is also heavily weighted towards brighter objects. 

The best comparison is with the LCRS: the compari- 
son sample is the largest, the magnitude range most simi- 
lar, and the catalogue is homogeneous (unlike ZCAT). The 
rms redshift difference of 109 kms^^ is consistent with the 
single-measurement uncertainty of 76kms~^ for objects in 
the 2dFGRS brighter than bj = 18.5 and the typical uncer- 
tainty of 67kms~^ on the LCRS redshifts (Shectman et al. 
1996). The pair- wise blunder rate in the comparison sample 
is 11/3135 = 0.4%, which in fact is rather lower than might 
be expected from the pair- wise blunder rate of 46/2920 — 
1.6% for repeat measurements of objects in the 2dFGRS 
brighter than b.] = 18.5; this may be due to the high surface 
brightness selection criterion of the LCRS. 

The worst comparison is with ZCAT: the rms redshift 
difference is 127kms~^ and the pair-wise blunder rate is 
103/1593 — 6.5%, which is more than twice the pair-wise 
blunder rate for the 2dFGRS repeats. Given the single- 
measurement blunder rate for the 2dFGRS is 1.6%, the im- 
plied single-measurement blunder rate for ZCAT is 5.0%, 
which is presumably due to the heterogeneous nature of its 
source catalogues. 

Although the samples in common with CfA2, SAPM 
and PSCz are relatively small, they give results that are 
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Figure 12. The magnitude distributions of tlie 2dFGRS sample, 
the repeat observations in the 2dFGRS sample, and the five sam- 
ples of objects in common with other redshift catalogues: CfA2, 
SAPM, PSCz, LCRS and ZCAT. 



consistent with the rms redshift errors and pair- wise blunder 
rate obtained from the 2dFGRS repeat observations. 

Checking the 2dFGRS spectra of the objects that have 
redshift blunders, we find that 0/1 cases in the compari- 
son with CfA2, 1/10 for SAPM, 1/7 for PSCz and 4/11 for 
LCRS might possibly be attributed to errors in the 2dFGRS 
redshift estimates. Examining the DSS images of these ob- 
jects, we find that 2 of the discrepancies with SAPM, 1 of 
those with PSCz, and 6 of those with LCRS, occur in cases 
where there is a close companion to the target object and 
hence some potential for confusion. 



8 SURVEY MASKS 

For accurate statistical analysis of the 2dFGRS it is essen- 
tial to fully understand the criteria that define its parent 
photometric catalogue and also the spatial and magnitude- 
dependent completeness of the redshift catalogue. For this 
purpose we have defined three maps or masks characterizing 
this information as a function of position on the sky: 

(1) The magnitude limit mask gives the extinction- 
corrected magnitude limit of the survey at each position. 

(2) The redshift completeness mask gives the fraction 
of measured redshifts at each position. 

(3) The magnitude completeness mask gives a parame- 
ter defining how the redshift success rate depends on appar- 
ent magnitude. 

Each mask has its own use, but for some analyses it is 
necessary to make use of two or even all three masks. We 
now describe in more detail how each one of these masks is 
defined and briefly outline some of their uses. 

8.1 Magnitude limit mask 

Although the 2dFGRS sample was originally selected to have 
a uniform extinction-corrected magnitude limit of b.7 = 19.45, 
in fact the survey magnitude limit varies slightly with po- 
sition on the sky. There are two reasons for this. First, the 
photometric calibrations now available are much more ex- 
tensive than when the parent 2dFGRS catalogue was origi- 
nally defined. This has enabled us to recalibrate the whole 
2dFGRS parent catalogue (Maddox et al. 2001, in prepa- 
ration), and results in new zero-point offsets and linearity 
corrections for each of the UKST photographic plates. Sec- 
ond, the extinction corrections have been changed to use the 
final published version of the Schlegel et al. (1998) extinc- 
tion maps; the original extinction corrections came from a 
preliminary version of those maps. 

The magnitude limit mask is therefore defined by the 
change in the photometric calibration of each UKST pho- 
tographic plate and the change in the dust extinction cor- 
rection at each position on the sky. The magnitude limit 
masks for the NGP and SGP strips are shown in Figure |l3| 
note that the mask also accounts for the holes in the source 
catalogue around bright stars and plate flaws. 

In the SGP, which is a subset of the APM galaxy sur- 
vey (Maddox et al. 1990a,b,c), the rms change in plate zero- 
point is only 0.03 mag. However, in the NGP region the orig- 
inal calibration was less accurate and the change in zero- 
points have an rms of 0.08 mag. The change in the dust 
corrections are also less in the SGP, as the extinction is gen- 
erally lower in this region. In the SGP the rms magnitude 
change due to improved dust corrections is 0.01 mag while 
in the NGP it is 0.02 mag. 

The magnitude limit distribution over the NGP and 
SGP strips is shown in Figure |l4| In the SGP the median 
limiting magnitude is bj=19.40 with an rms about this value 
of 0.05 mag; in the NGP the median limiting magnitude is 
bj=19.35 with an rms of 0.11 mag. 

For accurate statistical analysis of the 2dF survey the 
magnitude limits defined by this mask should be used. It 
is always possible to analyse the data with a fixed magni- 
tude limit if one is prepared to omit both the areas of the 
survey that have magnitude limits brighter than the chosen 
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Figure 13. The magnitude limit masks for the NGP strip (top) and the SGP strip (bottom), plotted in a zenithal equal area projection. 
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Figure 14. The fraction of the sky in the NGP and SGP survey 
strips where the survey limit is fainter than a given bj magnitude. 



limit and also all the galaxies in the remaining areas with 
magnitudes fainter than the chosen limit. 



8.2 Simple redshift completeness mask 

The best way to define a redshift completeness mask is to 
make use of the geometry defined by the complete set of 2° 
fields that were used to tile the survey region for spectro- 
scopic observations. Each region of the sky inside the survey 
boundary is covered by at least one 2° field, but more often 
by several overlapping fields. We define a sector as the region 
delimited by a unique set of overlapping 2° fields. This is the 
most natural way of partitioning the sky, as it takes account 
of the geometry imposed by the pattern of 2° fields and the 
way in which the galaxies were targeted for spectroscopic 
observation. Within each sector, 9, we define the redshift 
completeness, R{9), as the ratio of the number of galaxies 
for which redshifts have been obtained, N^{6), to the total 
number of objects contained in the parent catalogue, Np{8): 

Rie) = N.{e)/NA9). (4) 

The redshift completeness of a given sector, R{6), should 
be clearly distinguished from the redshift completeness of 
a given field, cp, since multiple overlapping fields can con- 
tribute to a single sector. 

This simple redshift completeness mask, shown in Fig- 
ure |l5[ can be used to locate regions in which the redshift 
completeness is high. It can also be used as a first step in 




Figure 15. The redshift completeness, R{0) = Nz{0)/Np{9) as a function of position for the 100k public release dataset (2dFGRS data 
from October 1997 to January 2000). The top panel is for the NGP, bottom panel for the SGP, plotted in a zenithal equal area projection. 

either applying weights to statistically correct for incom- 
pleteness or to construct a random unclustered catalogue 
(for use in estimating correlation functions) that have the 
same angular pattern of incompleteness as the redshift sam- 
ple. For this latter purpose one should also take account of 
how the redshift completeness depends on position within 
a sector as a result of constraints on fibre positioning and 
other considerations. This is best done by using the parent 
catalogue to derive weights for each galaxy with a measured 
redshift (Norberg et al. 2001b, in preparation). Also, as dis- 
cussed in the next section, one should take account of how 
the redshift completeness depends on apparent magnitude. 

8.3 Magnitude completeness mask 

The success rate of measuring redshifts is generally very 
high. Fields for which the field completeness, cf, is less than 
70% are re-observed, while of the remainder just over 76% 
have a completeness greater than 90%. As one approaches 
the magnitude limit of the survey it becomes increasingly 
difficult to obtain good-quality spectra from which reliable 
redshifts can be measured. Hence the success of measuring 
redshifts (the redshift completeness) is a function of appar- 
ent magnitude. 

In Figure |l^ we show the redshift completeness as a 
function of apparent magnitude for four different intervals of 
field completeness, cp. We see that in all cases the complete- 
ness is a function of apparent magnitude, and that the mag- 



nitude at which the completeness begins to drop is brightest 
in the fields with lowest cp, which are generally those taken 
in marginal observing conditions. Overall, the magnitude- 
dependent incompleteness is well modelled by a function of 
the form 

c^{m, ^i) = ■y{l - exp{m - ^i)) (5) 

with 7 = 0.99 and the parameter fj, depending on cp. For 
each field we have chosen to fix /j. by combining equa- 
tion ^ with a simple power-law model for the galaxy num- 
ber counts, N{in) oc exp(am), so that cp and p are related 
by 

J™^ N{m)cz{m, fj,) dm 

cf(m) = r^2 .r, . , ' (6) 

Jmi ^{^jdm 

where mi and m2 are the bright and faint magnitude limits. 
This integral can be evaluated to give 

cf(m) =7(1-5(0) exp(mi -/i)), (7) 
where 

5(a) = ^ e^P[(" + l)("^^-';^i)l-l (8) 
^ ' Q-l-1 exp [a(m2 - mi)] - 1 ^ ' 

and we take q=0.5 as appropriate for the galaxy number 
counts around bj = 19. This equation can be inverted, yield- 
ing 
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been assigned to this sector. For ease in calculating equa- 
tion ^) we tabulate Np{e)/Ne{e) as well as fi{e) and R{e), 
using, for convenience, a pixellated mask rather than the 
exact sectors. 



9 SURVEY DATABASE 

The 2dFGRS database consists of three main components: 
(i) a collection of FITS files, one per object in the source 
catalogue, that contains every piece of information about 
each object from the source catalogue, spectroscopic ob- 
servations and subsequent analysis; (ii) an mSQL database 
(Jepson & Hughes 1998), that contains all of the parameters 
for each object and allows complex searching and subsetting 
of the survey objects and the retrieval of selected subsets of 
spectra and images; and (iii) a WWW interface, that pro- 
vides a number of different modes for querying the mSQL 
database and a variety of ways for returning the results of 
such queries. The next three subsections describe these three 
database components. 



Figure 16. Redshift completeness as a function of apparent 
magnitude. The different panels are for fields of different over- 
all field completeness: top left, fields with completeness in the 
range 95% < cp < 100%; top right, fields with 90% < cp < 95%; 
bottom left, fields with 85% < cp < 90%; and bottom right, fields 
with 80% < Cp < 85%. Note that over 76% of the observed fields 
fall in the first two bins, with cp > 90%. The dashed curves are 
one-parameter model fits (sec text). 
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g{a) exp(rni) 



(1 - cp/7) 



(9) 



which enables fi to be computed for each observed field. 

Our goal is to define the value of n characterising the 
magnitude-dependent completeness for each position in the 
sky. Since the 2° fields overlap it again makes sense to de- 
fine ij.{9) for each sector 9. The value of fi{9) is defined by an 
appropriately- weighted average of the n values of the Nf{9) 
overlapping observed fields comprising the sector. Specifi- 
cally we take 
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'Np(e) 



fi exp(-/ii) 



(10) 



where fi is the fraction of the observed galaxies in this sector 
that were targeted in field i. 

Finally, it is straightforward to combine the magnitude 
completeness given by fj.{9) with the redshift completeness 
R{9) to define an estimate of redshift completeness that de- 
pends on both position (sector) and magnitude: 



S{9,m)^^^Ri9)c,{m,^{9)). 



(11) 



In the first factor, Np{6) is again the number of parent cat- 
alogue galaxies in this sector while 



Np(e) 

N49) = c,im„fii9)) 



(12) 



is an estimate of the number of galaxies one would expect 
to have measured redshifts for given the value of /i that has 



9.1 Object FITS files 

There are 467 214 target objects in the FITS database, each 
with its own FITS file. This number is much larger than 
the number of objects surveyed, since it includes objects 
down to bj=19.6 in the NGP and SGP strips. Each object in 
the survey source catalogue has been given a serial number 
(SEQNUM), and the name of the FITS file for that object is 
this serial number. The serial numbers for objects in the 
SGP strip are 1-193550 and 416694-467214 for objects in 
the NGP strip 193551-332694 and 389714-416693, and for 
objects in the random fields 332695-389713. 

Each FITS file has a primary part which contains all the 
source catalogue data about the object (as FITS keywords) 
and a DSS postage stamp image of the object. Spectra are 
appended as additional FITS extensions. Each spectrum ex- 
tension contains the spectrum of the object, the variance 
(error) array for the object spectrum, the spectrum of the 
mean sky that was subtracted from the object spectrum, 
and FITS keyword data giving information about the spec- 
troscopic observation and derived parameters such as the 
redshift and spectral quality. Many targets contain multiple 
spectrum extensions corresponding to multiple observations. 

Table ^ lists all the FITS keywords present in the pri- 
mary part (extension 0) of the FITS files; Table ^ lists all the 
keywords present in the spectrum extensions (1. . .spectra). 
The tables give the names of the keywords, example values, 
and the keyword definitions. 



9.2 mSQL parameter database 

The mSQL database can be thought of as a table. The rows 
of the table are labelled by the unique object serial number 
(SEQNUM) and the extension number (extnum), with multi- 
ple rows for each target object corresponding to each ex- 
tension in the object's FITS file. The columns of the table 
correspond to the object parameters, and are labelled by 
the name of the corresponding keyword. The object serial 
numbers (SEQNUM) provide the primary database key, but 
the objects are also indexed by their unique survey name 
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Table 3. FITS keywords for the primary image (extension 0) 



Keyword 




Example 


Definition 


SIMPLE 




T 


file does conform to FITS standard 


BITPIX 




16 


number of bits per data pixel 


NAXIS 




2 


number of data axes 


NAXISl 




49 


length of data axis 1 


NAXIS2 




49 


length of data axis 2 


EXTEND 




T 


FITS dataset may contain extensions 


BSCALE 




1.0000 


REAL = (FITS X BSCALE) + BZERO 


BZERO 




0.0000 


zeropoint of conversion to REAL 


SEQITOM 




100100 


Serial number : database primary key 


NAME 


'TGS469Z164' 


2dFGRS assigned name 


IMAGE 




'SKYCHART' 


Existence of postage stamp image 


RA 





.7943429758 


RA (B1950) in radians : 3 2 3.00 


DEC 


-0 


.6476286941 


DEC (B1960) in radians : -31 22 16.9 


EQUINOX 




1960.00 


equinox of RA and DEC 


BJSEL 




18.93 


final bj mag used in the object selection 


PROB 




2336.4 


psi star-galaxy classification parameter 


PARK 




0.910 


k star-galaxy classification parameter 


PARMU 




0.187 


mu star-galaxy classification parameter 


IGAL 




1 


final classification flag (1 for a galaxy) 


JON 




-1 


eyeball classification flag 


ORIENT 




91.0 


orientation in degrees clockwise from E-W 


ECCENT 




0.270 


eccentricity 


AREA 




308.0 


isophotal area in pixels 


X_BJ 




2918.7 


plate l^bj rn 8 micron pixels 


Y_BJ 




9123.1 


plate j/bj ^ micron pixels 


DX 




43.0 


corrected difference 100(xbj — Xfi) 


DY 




49.0 


corrected difference 100(j/]3j — yfi) 


BJG 




18.99 


bj without extinction correction 


RMAG 




10.35 


unmatched APM 'total' mag 


PMAG 




10.53 


unmatched raw APM profile integrated mag 


FMAG 




8.72 


unmatched raw APM 2" 'fibre' mag 


SMAG 




10.74 


unmatched raw stellar mag (from APMCAL) 


IFIELD 




417 


UKST field 


IGFIELD 




2007 


galaxy number in UKST field 


REGION 




'3417 ' 


GSSS region name 


OBJEQNX 




2000.00 


equinox of the plate reference frame 


OBJRA 





.8034094622 


RA (J2000) in radians : 03 04 07.673 


OBJDEC 


-0 


.5441390223 


DEC CJ2000) in radians : -31 10 36.73 


PLTSCALE 




67 . 2000 


Plate scale in arcsec per mm 


XPIXELSZ 




25 . 2844500 


X pixel size in microns 


YPIXELSZ 




25.2844500 


Y pixel size in microns 


OBJPLTX 




7970.86 


object X on plate (pixels) 


OBJPLTY 




4148.11 


object Y on plate (pixels) 


DATAMAX 




14431 


Maximum data value 


DATAMIN 




4011 


Minimum data value 


BJSELOLD 




18.96 


original bj mag used in the object selection 


BJG.OLD 




19.01 


original bj without extinction correction 


END 






End of FITS header 



(NAME), which has the format TGhfffZnnn, whore h is the 
hemisphere (N for the NGP strip and S for the SGP strip 
and random fields), fff is the number of the primary field 
to which the object is assigned and imn is the number of the 
galaxy within that field. Note that the observed name of the 
object (parameter OBSNAME in each spectrum extension) is 
the same as NAME except that: (i) if the field in which the 
object is observed (given by DBSFLD) is an overlapping field 
rather than its primary field (given by fff), then the first 
character of the name is changed from T to X; and (ii) if 
the object has been flagged as a possible merger, then the 
second character of the name is changed from G to M. 

The first row for each object (extnvmi=0) contains the 
source catalogue data and the basic spectroscopic informer 



Table 4. FITS keywords for the spectra (extensions 1. . .spectra) 



Keyword 


Example 


Definition 


XTENSION 


'IMAGE ' 


IMAGE extension 


BITPIX 


-32 


number of bits per data pixel 


NAXIS 


2 


number of data axes 


NAXISl 


1024 


length of data axis 1 


NAXIS2 


3 


length of data axis 2 


PCOUNT 





required keyword; must = 


GCOUNT 


1 


required keyword; must = 1 


CRVALl 


5802 . 8979492 


coordinate value of axis 1 


CDELTl 


4.3103027344 


coordinate increment on axis 1 


CRPIXl 


612.0000000000 


reference pixel on axis 1 


CUNITl 


' Angstroms ' 


units for axis 1 


EXTNAME 


'SPECTRUM' 


2dFGRS spectrum 


OBSNAME 


'TGS469Z164' 


observed object name 


OBSRA 


0.7943429758 


observed RA CB1960) in radians 


DBSDEC 


-0.6475286940 


observed DEC CB1950) in radians 


MATCH.DR 


0.0000 


position match error in arcsec 


Z 


0.178876 


raw measured redshift 


Z.HELIO 


0.178860 


heliocentric redshift 


QUALITY 


6 


redshift measurement quality 


ABEMMA 


1 


redshift type: abs=l,emi=2,man=3 


NMBEST 





# emission lines for emission z 


NGOOD 





number of good emission lines 


Z.EMI 


-9.9990 


emission redshift 


I3.Z_EMI 





emission redshift quality 


KBESTR 


2 


cross-correlation template 


R.CRCCR 


15.5600 


cross-correlation peak 


Z.ABS 


. 1789 


cross-correlation redshift 


Q.ZJBS 


3 


cross-correlation quality 


Q.FINAL 


3 


suggested quality for redshift 


lALTER 





IALTER=1 if automatic z altered 


Z.COMM 




observer ' s comment 


THPUT 


0.96613 


fibre throughput 


SPFILE 'sj 


5p469.991104.1z . f its ' 


2dF reduced data file 


PLATE 


1 


2dF plate number 


PIVOT 


302 


2dF pivot number 


FIBRE 


58 


2dF fibre number 


OBSRHN 


'99DCT ' 


observation run 


GRS.DATE 


'991104 ■ 


2dF YYMMDD observed date 


UTDATE 


'1999:11:04' 


UT date of observation 


SPECTID 


'A ' 


2dF spectrograph ID 


GRATID 


'300B ' 


2dF grating ID 


GRATLPMM 


300 


2dF grating line per mm 


GRATBLAZ 


'COLLIMATOR' 


2dF grating blaze direction 


GRATANGL 


25 . 30000 


2dF grating angle 


LAMBDAC 


5782 . 700 


central wavelength 


CCD 


'TEKTR0NIX.5' 


CCD ID 


CCDGAin 


2.790 


CCD inverse gain Ce per ADU) 


CCDNOISE 


6.200 


CCD read noise (electrons) 


OBJX 


196833 


2dF object X position 


OBJY 


10401 


2dF object Y position 


OBJXERR 


6 


2dF object X position error 


OBJYERR 


14 


2dF object Y position error 


OBJMAG 


18.96 


2dF object magnitude 


THETA 


4.526 


2dF fibre angle 


PTRTYPE 


'P ' 


2dF ptrtype 


PID 





2dF pid 


DBSFLD 


'sgp469 ' 


2dF observed field number 


NCCMB 


3 


number of frames combined 


REFRUN 


31 


AAT run number of reference 


UTSTART 


'16:37:69.48' 


UT start of reference exposure 


UTEND 


'16:67:69' 


UT end of reference exposure 


REFEXP 


1200.0 


reference run exposure (sees) 


REFHASTA 


'36.07264' 


HA start of reference exposure 


REFHAEND 


'41.08119' 


HA end of reference exposure 


ETA_TYPE 


-2. 593450 lE+00 


eta spectral type parameter 


SNR 


2.0299999E+01 


median S/N per pixel 
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Table 5. Additional mSQL database keywords 
(i) Keywords in all extensions 
Keyword Example Definition 



serial 

name 

UKST 

spectra 

extnum 

obsrun 

TDFgg 

pivot 

plate 

fiber 

z 

zjielio 
abemma 
quality 



100100 2dFGRS serial number 
TGS469Z164 2dFGRS name 

417 UKST sky survey field number 

1 number of spectra for this object 
1 extension number 
990CT observing run year and month 
-469 2dFGRS field number C+NGP, -SGP) 
302 2dF pivot 
1 2dF plate 
2dF fibre 
observed redshift 
heliocentric redshift 
redshift type Cabs=l , emi=2 , man=3) 
redshift quality parameter 



58 

0.178876 
. 178860 



(i) Keywords in extension only 


Keyword 


Example 


Definition 


alpha 


0.7943429758 


RA CB1950) in radians 


delta 


-0.5475286941 


DEC CB1950) in radians 


ra 


3 2 3.00 


RA CB1950) in HH MM 33. 33 


dec 


-31 22 15.9 


DEC (B1950) in DD HH 33.3 


ra2000 


03 04 07.68 


RA (J2000) in HH MM 33.33 


dec2000 


-31 10 36.8 


DEC (J2000) in DD HH 33.3 


12 


228.9258834424 


Galactic longitude 


b2 


-60.8572447739 


Galactic latitude 



tion from the best spectrum of that object. The keywords 
for that row are the FITS parameters for the primary image 
(Table ^) plus all the additional keywords listed in Table ^ 
The best spectrum is the one with the highest redshift qual- 
ity parameter; if there is more than one spectrum of the same 
quality, then the most recent of these spectra is used. Subse- 
quent rows for the same object (extnum=l. . .spectra, where 
spectra is the number of spectra obtained for that object) 
contain the FITS parameters pertaining to each spectro- 
scopic observation (Table ^ plus the additional keywords 
in section (i) of Table ^ If there is no spectrum for the 
object then spectra=0 and only the row corresponding to 
extnum=0 will exist. Note that some information is dupli- 
cated between rows and that not all parameters are defined 
for all rows; undefined parameters return a NULL value. 

Searches of the database use the mSQL query format 
(Jepson & Hughes 1998), which has the basic format 
SELECT list_of .parameters FROM database_name WHERE 
list_of .conditions 

and list_of .conditions is a series of equalities and 
inequalities linked by Boolean relations. 

An example is 
SELECT name, extnum, ra, dec, BJSEL, Z, QUALITY, 
z, quality FROM public WHERE name='TGS469Z164' 
which selects the listed parameters for the object with 2dF- 
GRS name TGS469Z164 (note the single quotes around the 
character string) from both the summary row (extnum=0) 
and for each spectrum (extnum=l. . .spectra). Note that 
the parameters with the same name in lower case and upper 
case are distinct: the former are generally from extnum=0, 
the latter from extnuni>0 (parameters are returned as NULL 
in rows where they are not defined). 



An example with a more complex list of conditions is 
SELECT name FROM public WHERE extnum=0 AND 
((BJSEL<15.5 AND quality>=3) OR quality>4) 
which lists just the names of the objects which are either 
brighter than b,i=15.5 with redshift quality at least 3, or 
have quality greater than 4, or both; the search is restricted 
just to the summary row by requiring extnum=0. 

Simple searches on the two indexed parameters, 
serial and name), are quick — e.g. WHERE serial=69656 
or WHERE name='TGS203Z081 ' ; more complex searches take 
about 5 minutes. Further information about the mSQL 
database software and its structured query language is 
given in Yarger et al. (1 999) and on the WWW at 



http:/ /www, hughes, com.au 



9.3 WWW interface 

The 2dFGRS mSQL database can be searched via the 
WWW interface in a number of ways: (i) search via a stan- 
dard mSQL query as described above — this is the most gen- 
eral method; (ii) perform a standard mSQL query restricted 
to a list of named objects; (iii) perform a standard mSQL 
query restricted to objects within a specified radius of a 
given position on the sky; (iv) match objects to a supplied 
catalogue of positions. 

The results of a query can be returned in several forms: 
as an HTML table, as an ASCII table, as a gzipped ASCII 
file, as an email giving the URL of a gzipped ASCII file 
created with a background job, or as a tar file containing 
the FITS files for the selected objects. The HTML table has 
buttons allowing you to select a particular object and view 
the postage stamp DSS image and all the observed spectra 
for the object; if the spectra have measured redshifts, then 
the positions of prominent spectral features are indicated at 
the redshift associated with each individual spectrum. The 
FITS files for the objects in the HTML table can also be 
bundled up into a compressed tar file which can then be 
down loaded by anonymous ftp. 

Full instructions for accessing the database are given on 



the survey website at http:/ /www. mso.anu.edu.au/2dFGRS 



and at mirror sites given there. 



10 RESULTS 



The redshift distribution of the galaxies in the survey is 
shown in Figure ^ Also shown is a simple analytic approx- 
imation to the redshift distribution, following Efstathiou & 
Moody (2001), with the form 



dN oc 2 exp 



1.36z\i-55 



dz 



(13) 



where z—Q.ll is the median redshift of the survey. 

The spatial distribution of the galaxies in the survey 
strips is shown in Figure |l^. This figure is the projection 
of the full width of the strips (10° in the NGP and 15° in 
the SGP). As not all fields have yet been observed, there 
are variations in the effective thicknesses of the strips in the 
directions of the missing fields that produce corresponding 
variations in the galaxy density. The clustering of galaxies 
revealed in this figure has been investigated by Peacock et al. 
(2001), Percival et al. (2001) and Norberg et al. (2001a). 
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Figure 17. The redshift distribution of the 2dFGRS (histogram) 
and a simple smooth analytic approximation (curve). 



The 2dFGRS has provided the first clear detection of 
the redshift-space clustering anisotropy on large scales that 
is a key prediction of the gravitational instability paradigm 
for the growth of structure in the universe (Peacock et al. 
2001). Measurements of this distortion yield a precise esti- 
mate of the parameter (3 = /h = 0.43 ± 0.07, where Q. 
is the total mass density of the universe and b is the lin- 
ear bias of the galaxies with respect to the mass. Combined 
with recent measurements of the anisotropics in the cosmic 
microwave background (Jaffe et al. 2001), this result favours 
a low-density universe, with r2«0.3. 

The power spectrum of the galaxy distribution has been 
determined from the survey using a direct FFT-based tech- 
nique (Percival et al. 2001). Over the range in wavenum- 
ber 0.02<fc<0.15 /iMpc"^, the shape of the observed power 
spectrum will be close to that of the linear density perturba- 
tions convolved with the window function of the survey. Fit- 
ting convolved power spectrum models constrains the shape 
parameter P = Qh to be 0.20 ± 0.03 (68% confidence level) 
and shows that models containing baryon oscillations are 
preferred over models without baryons at the 95% confidence 
level. This is the first detection of baryon oscillations in the 
galaxy distribution, and yields an estimate for the baryon 
fraction Qb/^^m. = 0.15 ± 0.07, assuming scale-invariant pri- 
mordial fiuctuations. 

The size of the 2dFGRS sample has allowed investiga- 
tion of the variation in the strength of galaxy clustering with 
luminosity, using the projected two-point correlation func- 
tion of galaxies in a series of volume-limited samples (Nor- 
berg et al. 2001a). The clustering of L* (M^^ - 5 log /i = 
— 19.7) galaxies in real space is well fitted by a power-law 
relation with exponent 7 — 1.71 ± 0.06 and a correlation 
length rg — (4.9 ± 0.3) Mpc. The exponent shows very 
Httle variation for galaxy samples with luminosities differ- 
ing by a factor of 40. The clustering amplitude, however, 
increases with luminosity: slowly for galaxies fainter than 
M* , but more strongly at brighter absolute magnitudes. 



This dependence of the correlation length on luminosity is 
in good agreement with the predictions of the hierarchical 
galaxy formation models of Benson et al. (2001). In terms 
of the bias parameter, the relation is well-represented by 
Sb/b* = 0.155L/L*, where Sb = b- b* , SL = L- L* a.nd the 
relative bias is given by 6/6* = (ro/ro)''^'^ ■ 

The 2dFGRS has also been used to characterise the 
internal properties of the galaxy population. Folkes et al. 
(1999) and Madgwick et al. (2001a) have determined the 
luminosity function for galaxies, both overall and as a func- 
tion of spectral type. Principal Component Analysis has 
been applied to the 2dFGRS spectra, and a linear combina- 
tion of the first two principal components, rj, has been used 
to parametrise the spectral type. Going from early types 
to late types, the luminosity functions appear to exhibit a 
systematic decrease in the characteristic luminosity (from 
Mb J — 5 log 6, = —19.6 to —19.0) and a steepening of the 
faint-end slope (from a = —0.52 to —1.43). However there is 
also evidence that, at the precision afforded by the 2dFGRS 
sample, the standard Schechter fitting function is no longer 
an adequate representation of the luminosity function. 

Figure |l9| shows the redshift slices for each of the four 
spectral types defined in terms of the 77 parameter by Madg- 
wick et al. (2001a): type 1 are the earliest types and type 4 
the latest types. The association of earlier types with clus- 
ters and local density enhancements is apparent, as is the 
tendency for later types to be associated with lower-density 
regions. A quantitative study of the clustering of different 
spectral types will be the subject of a subsequent paper. 

The large 2dFGRS sample also allows the generalisa- 
tion of the luminosity function into the bivariate bright- 
ness distribution (BED) over both luminosity and surface 
brightness (Cross et al. 2001). The BBD derived from the 
2dFGRS shows a strong surface brightness-luminosity rela- 
tion, Mb J oc (2.4±o;5)/ie- The luminosity density is domi- 
nated by normal giant galaxies and the peak of the BBD 
lies away from the survey selection boundaries, implying 
that the 2dFGRS is complete and that luminous low sur- 
face brightness galaxies are rare. By integrating over the 
BBD, the local luminosity density is estimated to be Jb = 
(2.5 ± 0.2) X W^hLQ Mpc-^ 

Cole et al. (2001) have combined the 2dFGRS with the 
2MASS extended source catalogue to produce an infrared- 
selected sample of over 17000 galaxies with redshifts. This 
sample has been used to determine the J and T^s-band 
galaxy luminosity functions. The luminosity functions (in 
2MASS Kron magnitudes) are fairly well fit by Schechter 
functions: in J with M} - 5 log /i = -22.36 ± 0.02, aj = 
-0.93 ± 0.04, = 0.0104 ± 0.0016 /i^ Mpc-^ and in Ks 
by - 51og/i = -23.44 ± 0.03, qa' = -0.96 ± 0.05, 
$A = 0.0108 ± 0.0016 Mpc~^. From the distributions of 
B—K and J—K colours with absolute magnitude and mod- 
els of the stellar populations, the galaxy stellar-mass func- 
tion can be estimated. Integrated over all galaxy masses, it 
yields a total mass fraction in stars (in units of the critical 
density) of flstars/i = (1.6±0.2) x 10"^ for a Kennicutt IMF, 
and fistars/i = (2.9 ± 0.4) x 10"^ for a Salpeter IMF. These 
values are consistent with estimates based on the time inte- 
gral of the observed star formation history of the universe 
only if dust extinction corrections at high redshift are mod- 
est. 
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Figure 18. The projected distribution of the galaxies in tlic NGP (top) and SGP (bottom) strips, as a function of redshift and R.A.; 
the variations in the galaxy density with R.A. are due to variations in the effective widths of the strips. 




Figure 19. Redshift slices for different spectral types: type 1 corresponds to E/SO, type 2 to Sa/Sb, type 3 to Sc/Sd and type 4 to Irr. 
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Figure 19. (ctd) 
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The 2dFGRS can add significant value to all-sky imag- 
ing surveys at all wavelengths, as in the studies of the radio 
galaxy population combining redshifts and optical spectra 
from the 2dFGRS with the NVSS radio survey (Sadler et al. 
2001) and the FIRST radio survey (Magliocchetti et al. 
2001). Using 20% of the fuU 2dFGRS area, Sadler et al. 
(2001) find 757 optical counterparts for NVSS sources — the 
largest and most homogeneous set of radio-source spectra to 
date. These sources range from 2=0.005 to 2=0.438, and are 
a mixture of active galaxies (60%) and star-forming galax- 
ies (40%). The local radio luminosity function at 1.4 GHz 
is determined for both active and star-forming galaxies, and 
yields an estimate for the local star-formation density of 
(2.8 ± 0.5) X 10"^ Mq yr-i Mpc"^ 



of FITS files, one per object, which contain all the param- 
eters and spectra for each object, and a mSQL parame- 
ter database which can be used for sophisticated search- 
ing, matching and sorting of the survey data. The 2dF- 
G RS database is available through the s urvey WWW site 



at 



http:/ /www. mso.anu.cdu.au/2dFGRS 
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11 CONCLUSIONS 

The 2dF Galaxy Redshift Survey (2dFGRS) is designed to 
measure redshifts for approximately 250 000 galaxies. The 
survey uses the 2dF multi-fibre spectrograph on the Anglo- 
Australian Telescope, which is capable of observing up to 
400 objects simultaneously over a 2° diameter field of view. 
The source catalogue for the survey is a revised and ex- 
tended version of the APM galaxy catalogue, which is based 
on Automated Plate Measuring machine (APM) scans of 390 
plates from the UK Schmidt Telescope (UKST) Southern 
Sky Survey. The target galaxies have extinction-corrected 
magnitudes brighter than bj = 19.45, with extinctions de- 
rived from the maps of Schlegel et al. (1998). 

The main survey regions are two contiguous declina- 
tion strips, one in the southern Galactic hemisphere (the 
SGP strip) covering 80° x 15° centred close to the SGP, and 
the other in the northern Galactic hemisphere (the NGP 
strip) covering 75° x 10° . In addition, there are 99 random 
fields are spread uniformly over the entire region of the 
APM catalogue in the southern Galactic hemisphere outside 
the SGP strip. In total the survey covers some 2000 deg^ 
and has a median depth of 2=0.11. Out to the effective 
limit of the survey at 2«0.3, the strips contain a volume of 
1.2x 10* ^^'^ Mpc^; the volume sparsely sampled including 
the random fields is two to three times larger. An adaptive 
tiling algorithm is used to give a highly uniform sampling 
rate of 93% over the whole survey region. 

Spectra are obtained over the wavelength range 3600- 
8OOOA at two-pixel resolution of 9.0A. The median S/N is 
13 pixel" over 4OOO-75OOA. Redshifts are measured both 
by cross-correlation with a range of template spectra and 
by fitting strong spectral features. All redshift identifica- 
tions are visually checked and assigned a quality parameter 
Q in the range 1-5. From repeat measurements and compar- 
isons with other redshift catalogues, we find that redshifts 
with Q=3 have a blunder rate (fraction of incorrect identi- 
fications) of just under 10% and Q=4 and 5 redshifts have 
blunder rates less than 1% — the overall blunder rate for re- 
liable (Q>3) redshifts is 3%. The overall rms uncertainty 
in the Q>3 redshifts is 85kms~^. The completeness of the 
survey is computed as a function of both field and appar- 
ent magnitude. The overall redshift completeness is 91.8%, 
but this varies with magnitude from 99% for the brightest 
galaxies to 90% for objects at the survey limit. 

The survey database has two components: a collection 
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